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PREFACE 


This  report  was  prepared  by  Dr.  William  W.  Walker,  Jr.,  Environ¬ 
mental  Engineer,  Concord,  Mass.,  for  the  U.  S.  Army  Engineer  Waterways 
Experiment  Station  (WES)  under  Contract  DACW39-78-0053  dated  7  June  1978. 
The  study  forms  part  of  the  Environmental  and  Water  Quality  Operational 
Studies  (EWQOS)  Work  Unit  IE,  Simplified  Techniques  for  Predicting  Reser¬ 
voir  Water  Quality  and  Eutrophication  Potential.  The  EWQOS  Program  is 
sponsored  by  the  Office,  Chief  of  Engineers,  and  is  assigned  to  the  WES 
under  the  purview  of  the  Environmental  Laboratory  (EL) . 

The  study  was  under  the  direct  WES  supervision  of  Dr.  Robert  H. 
Kennedy  and  the  general  supervision  of  Mr.  Donald  L.  Robey,  Chief,  Water 
Quality  Modeling  Group;  Dr.  Rex  L.  Eley,  Chief,  Ecosystem  Research  and 
Simulation  Division;  Dr.  Jerry  Mahloch,  Program  Manager,  EWQOS;  and 
Dr.  John  Harrison,  Chief,  EL. 

The  Commanders  and  Directors  of  WES  during  this  study  were 
COL  John  L.  Cannon,  CE,  and  COL  Nelson  P.  Conover,  CE.  The  Technical 
Director  was  Mr.  Fred  R.  Brown. 


This  report  should  be  cited  as  follows: 

Walker,  W.  W. ,  Jr.  1981.  "Empirical  Methods 
for  Predicting  Eutrophication  in  Impoundments; 
Phase  I:  Data  Base  Development,"  Technical 
Report  E-81-9,  prepared  by  William  W. 

Walker,  Jr.,  Environmental  Engineer,  Concord, 
Mass.,  for  the  U.  S.  Army  Engineer  Waterways 
Experiment  Station,  CE,  Vicksburg,  Miss. 
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CONVERSION  FACTORS,  U.  S.  CUSTOMARY  TO  METRIC  (SI) 
UNITS  OF  MEASUREMENT 


U.  S.  customary  units  of  measurement  used  in  this  report  can  be  con¬ 
verted  to  metric  (SI)  units  as  follows: 


_ Multiply _ 

acres 

acre-feet 

cubic  feet  per  second 
Fahrenheit  degrees 
feet 
inches 

miles  (U.  S.  statute) 
pounds  (mass)  per  cubic  foot 
square  miles 
tons  (2000  lb,  mass) 


_ By 

4046.873 

1233.482 

0.02831685 

5/9 

0. 3048 
2.54 

1.609344 

16.01846 

2.589988 

907.1847 


_ To  Obtain _ 

square  metres 

cubic  metres 

cubic  metres  per  second 

Celsius  degrees  or  Kelvins* 

metres 

centimetres 

kilometres 

kilograms  per  cubic  metre 
square  kilometres 
kilograms 


*  To  obtain  Celsius  (C)  temperature  readings  from  Fahrenheit  (F)  read¬ 
ings,  use  the  following  formula:  C  =  (5/9) (F  -  32).  To  obtain  Kelvin 
(K)  readings,  use:  K  =  (5/9) (F  -  32)  +  273.15. 
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EMPIRICAL  METHODS  FOR  PREDICTING 


EUTROPHICATION  IN  IMPOUNDMENTS 
PHASE  I:  DATA  BASE  DEVELOPMENT 


PART  I :  INTRODUCTION 


1.  This  report  documents  the  development  of  a  data  base  describing 
certain  water  quality  aspects  of  reservoirs  operated  by  the  U.  S.  Army 
Corps  of  Engineers  (CE) .  The  data  base  includes  information  on  project 
location,  morphometry,  water  quality,  hydrology,  and  sedimentation.  As 
part  of  the  Environmental  and  Water  Quality  Operational  Sutdies  (EWQOS) 

Program  being  conducted  by  the  Office,  Chief  of  Engineers,  U.  S.  Army, 
this  work  has  been  conducted  to  provide  groundwork  for  assessing  empiri¬ 
cal  approaches  to  describing  and  predicting  reservoir  trophic  status. 

2.  One  basic  strategy  employed  in  assembling  this  data  base  has 
been  to  utilize  existing,  centralized  sources  of  information  first. 

These  have  included  nationwide  data  bases  maintained  by  various  federal 
agencies,  as  well  as  a  few  sources  of  data  tabulated  at  a  regional  level. 

A  framework  has  been  designed  and  implemented  for  storing  and  accessing 
this  information,  with  flexibility  for  updating  and  general  access,  so  as 
to  meet  the  specific  objectives  of  this  project.  Having  utilized  cen¬ 
tralized  data  sources  to  their  fullest  extent,  data  gaps  have  been  identi¬ 
fied  and  used  to  set  priorities  for  locating  and  incorporating  information 
from  relatively  diffuse  sources,  such  as  specific  project  design  memoranda 
and  other  published  or  unpublished  reports  dealing  with  individual  projects. 

This  stage-wise  data- gathering  procedure  has  been  designed  with  efficiency 
and  cost-effectiveness  in  mind. 

3.  Another  basic  strategy  which  has  been  employed  in  compiling 
water  quality  and  hydrologic  data  has  been  to  assemble  individual  obser¬ 
vations  in  space  and  time  (i.e.,  "raw  data"),  rather  than  average  values. 

This  strategy  accomplishes  the  following: 

i 


a.  It  provides  for  the  broadest  possible  range  of  future 
uses  of  the  data  base. 

b.  It  eliminates  possible  variations  due  to  the  use  of  dif¬ 
ferent  averaging  procedures. 

£.  It  provides  a  basis  for  error  analysis  and  assessment  of 
data  adequacy  in  future  model  testing. 

These  advantages  must  be  weighed  against  the  major  disadvantage  of  the 

approach — it  involves  management  of  a  large  amount  of  information.  The 

water  quality  file  presently  contains  two  million  observations  taken  at 

4451  stations  located  in  or  around  271*  CE  projects. 

4.  Management  of  these  types  and  quantities  of  information  entails 
use  of  a  consistent  framework.  One  basic  strategy  has  been  to  tag  each 
bit  of  information  with  district,  project,  and  data  source  codes.  While 
the  validity  of  the  information  at  its  original  source  cannot  be  sub¬ 
stantiated,  use  of  a  systematic  approach  in  building  the  data  base  insures 
that  the  data  are  transferred  and  accessed  properly.  Keeping  track  of 
original  data  sources  provides  a  means  of  checking  any  piece  of  informa¬ 
tion  at  its  source  and  identifying  discrepancies  among  multiple  data 
sources  for  the  same  value.  The  latter  provides  one  indication  of  data 
and  source  reliability.  Another  validity  test  involves  checking 

for  internal  consistency  in  a  given  set  of  values.  For  example,  the 
morphometric  profiles  have  been  tested  by  comparing  reported  volumes  at 
any  elevation  with  the  integral  of  reported  areas  with  respect  to  depth. 
The  third  validity  test  involves  distribution  of  portions  of  the  data 
base  to  district  offices  for  verification  and  editing.  This  entails 
their  cooperation  and  assumes  that  district-level  sources  of  information 
for  specific  projects  are  the  most  accurate.  This  approach  has  been 
taken  for  upgrading  the  morphometric  data  file  with  reasonable  success. 

5.  In  its  current  state,  the  data  base  is  a  collection  of  infor¬ 
mation  in  a  well-defined  framework.  It  is  not  a  user-oriented  system 
designed  for  frequent  interactive  use.  Such  a  system  would  require 


*  A  total  of  299  projects  are  included  in  the  data  base;  no  water  quality 
data  have  been  located  for  28. 
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extensive  software  development  geared  to  a  specific  computer  system  and 
to  the  intended  uses  of  the  information  in  various  areas  of  reservoir 
management.  The  scope  of  this  project  has  been  limited  to  compiling 
the  information,  organizing  it,  and  extracting  portions  that  are  directly 
relevant  to  analysis  of  eutrophication  problems  in  reservoirs.  With 
additional  software  development  and  systems  programming,  the  data  base 
could  be  made  accessible  for  more  generalized  purposes. 

6.  The  complete  data  base  consists  of  a  collection  of  computer 
files,  reports,  data  forms,  and  maps.  As  discussed  above,  each  piece 
of  information  is  referenced  by  CE  district,  project,  and  data  source 
codes.  Part  II  of  this  report  describes  the  facilities,  methods,  and 
agency  contacts  used  in  this  work.  Part  III  describes  the  general 
structure  of  the  data  base.  Parts  IV  through  XI  document  the  sources 
and  approaches  used  in  compiling  each  element  and  present  data  inven¬ 
tories.  Parts  XII-XIV  summarize  results  of  specific  analyses  which  lay 
the  groundwork  for  use  of  the  data  base  in  Phase  II  of  this  project. 

These  analyses  cover  the  following  topics:  (XII)  numerical  characteri¬ 
zation  of  reservoir  hypsographic  curves;  (XIII)  assessment  of  the 
variability  of  trophic  state  indicators  in  reservoirs;  and  (XIV)  testing 
of  methods  for  estimation  of  nutrient  budgets.  Conclusions  and  Recom¬ 
mendations  are  given  in  Part  XV  and  XVI,  respectively.  Appendix  A 
contains  data  inventories  by  project  and  district. 


PART  II:  FACILITIES  AND  METHODS 


7.  Compilation  and  manipulation  of  the  various  data  files  docu¬ 
mented  in  this  report  have  been  done  on  an  IBM  370-168  computer  maintained 
by  the  Information  Processing  Center  of  the  Massachusetts  Institute  of 
Technology  (MIT) .  This  facility  has  been  used  in  a  batch  processing  mode 

( OS/VS 1)  and  in  an  interactive  mode  through  IBM's  Conversational  Monitor¬ 
ing  System  (CMS) .  Three  media  have  been  used  for  data  storage  where 
appropriate:  (1)  9-track  tapes  (6250  bytes  per  inch);  (2)  3350  disc 

packs  (OS  and  CMS);  and  (3)  cards.  Copies  of  the  current  versions  of 
all  files  have  been  transferred  to  tapes  for  secure  storage  and  future 
access. 

8.  While  most  of  the  information  used  to  assemble  the  data  base  has 
been  read  from  tapes  supplied  by  various  agencies,  some  files  (in  par¬ 
ticular,  the  project  lists,  morphometry,  and  sedimentation  files)  have 
been  assembled  from  tabulated  data.  In  these  cases,  cards  have  been 

used  for  data  entry.  Keypunching  has  been  done  and  verified  using  con¬ 
tract  services  offered  by  MIT. 

9.  Programming  for  data  manipulation  and  analysis  is  in  the  PL/I 

and  FORTRAN  IV  languages.  The  Biomedical  Computer  Program  package 
1  2 

(BMDP)  and  SAS  have  also  been  used  in  preliminary  data  analyses.  Plots 
have  been  produced  with  a  Calcomp  line  plotter. 

10.  Access  to  the  Environmental  Protection  Agency  (EPA)  STORET 
system3  has  been  acquired  through  the  cooperation  of  the  Water  Quality 
Laboratory  of  the  New  England  Division  of  the  Corps  of  Engineers.  The 
staff  of  the  Systems  Analysis  Branch  of  the  EPA  Region  I  Office  in  Boston 
has  been  helpful  in  submitting  STORET  retrievals.  The  identification  of 
water  quality  and  quantity  monitoring  stations  has  been  done  partially 
using  the  services  of  the  National  Water  Data  Exchange  of  the  U.  S. 
Geological  Survey  in  Reston,  Virginia.  The  Corvallis  Environmental 
Research  Laboratory  of  the  U.  S.  Environmental  Protection  Agency  has 
provided  reports  and  data  files  from  the  National  Eutrophication  Survey 

4 

(NES)  .  Sedimentation  survey  sheets  have  been  obtained  through  the 
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Sedimentation  Laboratory  of  the  U.  S.  Department  of  Agriculture  and  the 
South  Technical  Service  Center  of  the  U.  S.  Soil  Conservation  Service 
in  Fort  Worth,  Texas. 

11.  Staff  members  of  the  Environmental  Laboratory  of  the  U.  S. 

Army  Engineer  Waterways  Experiment  Station  (WES)  have  provided  assistance 
in  extracting  and  coding  morphometric  and  drainage  area  data  from  project 
design  memoranda  and  in  coding  water  quality  data  complied  outside  of 
STORET.  The  Ohio  River  Division  (ORD)  of  the  Corps  of  Engineers  provided 
tapes  containing  water  quality  data  gathered  by  district  monitoring 
programs  in  that  division. 


PART  III:  DATA  BASE  STRUCTURE 


12.  Figure  1  depicts  the  organization  of  the  data  base  into 

eight  major  file  groups: 

CODES  -  Data  Base  Codes 

LISTS  -  Project  Lists 

WATS  -  Watershed  Characteristics 

RESER  -  Reservoir  Characteristics 

HYDRO  -  Hydrology  Data 

WQ  -  Water  Quality  Data 

SED  -  Sedimentation  Data 

NES  -  EPA  National  Eutrophication  Survey  Data 

Each  group  contains  a  number  of  computer  files,  data  forms,  and/or  maps. 
File  names  are  in  two  parts.  The  first  refers  to  the  major  file  group 
and  the  second,  to  the  specific  file  within  that  group.  For  example, 
the  water  quality  station  key  is  given  the  name  WQ.KEY.  A  lowercase 
second  name  indicates  that  the  element  is  a  map  or  data  form  (not  a 
computer  file) . 

13.  U.  S.  customary  units  have  been  used  most  extensively  in  the 
files.  This  has  facilitated  the  transfer  and  verification  of  information, 
since  most  of  the  original  sources  of  morphometric,  drainage  are*,  hydro- 
logic,  and  sedimentation  data  were  also  in  U.  S.  customary  units.  Or.e 
exception  to  this  convention  is  the  EPA  National  Eutrophication  Survey 

4 

Compendium  file,  which  was  supplied  by  the  EPA  in  metric  units. 

14.  The  development,  structure,  and  contents  of  each  file  group 
are  discussed  in  the  following  sections.  The  sources  and  approaches 
used  in  compiling  the  information  are  described.  Each  file  is  character¬ 
ized  with  respect  to  format  and  content.  Since  most  files  are  too  large 
for  listing,  data  holdings  are  summarized  in  an  inventory  format,  with 
categories  defined  by  file,  variable,  and  CE  division.  Data  inventories 
by  project  and  district  are  included  in  Appendix  A.  Record  formats  for 
the  files  described  in  this  report  are  defined  as  PL/I  data  structures. 


PART  IV:  CODES  -  DATA  BASE  CODES 

15.  CODES  consists  of  a  group  of  files  which  define  referencing 
systems  used  in  various  portions  of  the  data  base.  These  include  the 
following: 

CODES. DIS  CE  District  Codes 

CODES. DIV  CE  Division  Codes 

CODES. SOURCE  Data  Source  Codes 

CODES. TYPE  Station  Type  Codes 

CODES. POOL  Pool  Codes 

CODES. OUTLET  Outlet  Codes 

CODES. PARAM  Parameter  Codes 

Listings  of  these  files  are  given  in  Tables  1  through  5. 

16.  District  and  division  codes  (Table  1)  provide  a  numerical 
indexing  system  for  each  of  36  districts  and  10  divisions,  respectively. 
Districts  are  grouped  within  divisions.  The  New  England  Division  is 
unique  in  that  it  is  not  comprised  of  districts.  In  order  to  permit 
referencing  of  all  projects  at  the  district  level,  district  number  one 
has  been  defined  to  represent  the  New  England  Division. 

17.  Data  source  codes  (Table  2)  provide  a  referencing  system  for 
nine  data  sources  which  are  used  frequently  in  the  data  base.  Identi¬ 
fying  each  data  entry  by  source  provides  a  basis  for  validation  and 
sorting  out  discrepancies  among  multiple  data  sources  for  a  given  project 
and  characteristic. 

18.  A  total  of  nine  station  type  codes  have  been  defined  for  use 
in  the  water  quality  and  hydrology  files  (Table  3) .  These  provide  a 
frame  of  reference  for  locating  monitoring  stations  within  a  given 
project.  Broadly,  these  permit  distinction  among  stations  located  on 
upstream  tributaries,  within  reservoir  pools,  and  in  or  below  reservoir 
discharge  streams.  Within-pool  stations  are  further  classified  as 
upper-pool,  mid-pool,  or  near-dam.  Mid-pool  is  used  as  a  default  for 
lake  stations.  The  remaining  two  are  used  in  cases  where  coordinates, 
maps,  and/or  station  location  descriptions  provide  an  adequate  basis 
for  more  refined  classification.  Secondary  tributary  codes  (upstream 
and  downstream  from  impoundments)  have  been  used  only  for  some  EPA 
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National  Eutrophication  Survey  stations  to  aid  in  hydrologic  budget 
computations. 

19.  Pool  and  outlet  codes  (Table  4)  are  used  in  the  morphometric 
file.  These  provide  systems  for  referencing  various  elevations  to  pool 
allocations  for  specific  uses,  ranges  of  operating  levels,  and  locations 
and  types  of  principal  outlets.  The  systems  were  initially  designed 

at  WES.  Additional  codes  have  been  added  as  needed  during  subsequent 
morphometric  data  compilation. 

20.  The  parameter  codes  file  (Table  5)*  is  used  to  reference 
hydrologic  and  water  quality  data.  The  file  contains  94  members,  each 
identified  by  a  water  quality  parameter  code,  STORET  code"^,  measurement 
type,  and  units.  The  5-digit  STORET  code  is  used  in  retrieving  water 
quality  and  hydrologic  data  from  the  STORET  system.  It  is  also  used 

to  identify  measurements  in  the  hydrology  files.  In  addition  to  the 
89  basic  water  quality  parameter  codes  included  in  the  file,  there  are 
11  redundant  parameter  codes,  which  have  been  used  in  retrieving  water 
quality  data  from  STORET.  Redundant  codes  result  from  multiple  means 
of  expressing  a  given  type  of  observation  (e.g. ,  temperature  in  degrees 
C  or  degrees  P  or  total  phosphorus  as  P  or  as  P04) .  Redundancies  have 
been  eliminated  in  final  data  storage  by  applying  appropriate  conversion 
factors  in  each  case. 
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Table  1 


District  and  Division  Codes 


Code 

District 

Div 

Code 

Division 

01 

Mew  England 

01 

01 

New  England 

02 

New  York 

02 

02 

North  Atlantic 

03 

Philadelphia 

02 

03 

South  Atlantic 

04 

Baltimore 

02 

04 

Ohio  River 

05 

Norfolk 

02 

05 

North  Central 

06 

Wilmington 

03 

06 

Lower  Mississippi  Valley 

07 

Charleston 

03 

07 

South  West 

08 

Savannah 

03 

08 

Missouri  River 

09 

Jacksonville 

03 

09 

North  Pacific 

10 

Mobile 

03 

10 

South  Pacific 

11 

Buffalo 

05 

12 

Detroit 

05 

13 

Chicago 

05 

14 

Rock  Island 

05 

15 

St.  Paul 

05 

16 

Pittsburgh 

04 

17 

Huntington 

04 

18 

Louisville 

04 

19 

Nashville 

04 

20 

St.  Louis 

06 

21 

Memphis 

06 

22 

Vicksburg 

06 

23 

New  Orleans 

06 

24 

Little  Rock 

07 

25 

Tulsa 

07 

26 

Fort  Worth 

07 

27 

Galveston 

07 

28 

Albuquerque 

07 

29 

Kansas  City 

08 

30 

Omaha 

08 

31 

Walla  Walla 

09 

32 

Seattle 

09 

33 

Portland 

09 

34 

Sacramento 

10 

35 

San  Francisco 

10 

36 

Los  Angeles 

10 

Table  2 


Data  Source  Codes 


_ Data  Source _ 

Leidy  and  Jenkins 

EPA  National  Eutrophication  Survey 
District  or  Division 
Sedimentation  Survey  Sheets 
Design  Memoranda 

USGS  State  Water  Resources  Data  Reports 
USGS/WATSTORE  File 
EPA  STORET 

INFONET  -  Ohio  River  Division 


Table  3 

Station  Type  Codes 


Code 

Station  Type 

01 

Tributary 

02 

Pool 

03 

Discharge 

04 

Pool  (nr.  dam) 

05 

Pool  (headwaters) 

06 

Unused 

07 

Point  source 

08 

Sec.  trib.  (downstr. 

09 

Sec.  trib.  (upstr.) 

Flood  control 
Conservation 
Water  quality 
Minimum 
Summer 

Winter 
Water  supply 
Power 
Recreation 
Dead  storage 

Multiple  use 
Stream  bed 
Top  of  dam 

Period  of  record  minimum 
Period  of  record  maximum 

Normal 

Maximum  power 
Minimum  power 
Sediment 

Maximum  regulated 


Outlet 


WLfi 


>e 


Table  5 


f 


Parameter  Codes 


STORET 

HYDRO 

w Q  Component 

Units 

00027 

00028 

72025 

01  Code  for  agency  collecting  sample 

02  Code  for  agency  analyzing  sample 

03  Depth  of  pond  or  reservoir 

feet 

00068 

04  Maximum  sample  depth 

feet 

72020 

72020 

05  Elevation 

ft  >  msl 

00062 

06  Elevation,  reservoir  surface 

ft 

72030 

07  Elevation  of  reservoir  pool 

ft  >  msl 

00054 

00054 

08  Reservoir  storage 

acre-ft 

72033 

09  Flow,  average  daily,  spillway 

cfs 

72034 

10  Flow,  instantaneous,  spillway 

cfs 

00061 

11  Stream  flow,  instantaneous 

cfs 

00060 

00060 

12  Stream  flow,  daily 

cfs 

00065 

00065 

13  Stream  stage 

feet 

00010 

00010 

14  Temp 

deg-C 

00011 

*14  Temp 

deg-F 

00300 

00300 

15  02  Dissolved 

mg/1 

00299 

16  02  Dissolved,  electrode 

mg/1 

00090 

17  Oxidation  reduction  potential 

mv 

00094 

18  Specific  conductivity,  field 

umhos/cm 

00095 

00095 

19  Specific  conductivity,  lab 

umhos/cm 

00400 

00400 

20  pH  (field) 

su 

00403 

21  pH  (lab) 

su 

00410 

22  Alkalinity,  total  as  CaCo3 

mg/1 

00435 

23  Acidity,  total  as  CaCo3 

mg/1 

00900 

24  Hardness,  total  as  CaCo3 

mg/1 

00940 

00940 

25  Chloride 

mg/1 

00945 

00945 

26  Sulfate  total 

mg/1 

01045 

27  Iron,  total  as  Fe 

ug/1 

71885 

*27  Iron  total  as  Fe 

mmg/1 

01046 

28  Iron,  dissolved 

ug/1 

01055 

29  Manganese,  total  as  Mn 

ug/1 

71883 

*29  Manganese  total  as  Mn 

mmg/1 

01056 

30  Manganese,  dissolved 

ug/1 

(Continued) 


*  Redundant  water  quality  parameter  code. 
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Table  5  (Continued) 


STORET 

HYDRO 

WQ  Component 

Units 

00916 

31 

Calcium,  total 

mg/1 

00915 

32 

Calcium,  dissolved 

mg/1 

00927 

33 

Magnesium,  total 

mg/1 

00925 

34 

Magnesium,  dissolved 

mg/1 

00929 

35 

Sodium,  total 

mg/1 

00930 

36 

Sodium,  dissolved 

mg/1 

00937 

37 

Potassium,  total 

mg/1 

00935 

38 

Potassium,  dissolved 

mg/1 

00070 

00070 

39 

Turbidity 

jtu 

00074 

40 

Turbidity,  transmissometer ,  percent 

transmission 

percent 

00076 

41 

Turbidity,  Hach  turbidometer 

ftu 

00078 

42 

Transparency,  Secchi 

m 

00077 

*42 

Transparency,  Secchi 

in 

00031 

43 

Light,  percent  remaining  at  given  depth 

percent 

00034 

44 

Depth  at  which  1  percent  of  surface 

light  rem 

ft 

00080 

45 

Color,  true 

pt-co  units 

00081 

46 

Color,  apparent 

pt-co  units 

00955 

47 

Silica,  dissolved 

mg/1 

00956 

48 

Silica,  total 

mg/1 

00310 

49 

B0D5 

mg/1 

00405 

50 

Carbon  dioxide 

mg/1 

00680 

51 

Carbon  total  organic 

mg/1 

00681 

52 

Carbon  dissolved  organic 

mg/1 

00685 

53 

Carbon  total  inorganic 

mg/1 

00691 

54 

Carbon,  dissolved  inorganic 

mg/1 

00665 

55 

Phosphorus,  total  as  P 

mg/1 

00650 

*55 

Phosphate,  total  as  P04 

mg/1 

71886 

*55 

Phosphorus  total  as  P04 

mg/1 

00666 

56 

Phosphorus,  dissolved  as  P 

mg/1 

00669 

57 

Phosphorus  total  hydrolyzable  as  P 

mg/1 

00678 

58 

Phosphorus,  hydrol  +  ortho,  total. 

autoanal 

mg/1 

00671 

59 

Phosphorus,  dissolved  ortho  as  P 

mg/1 

00660 

*59 

Phosphate,  ortho  as  P04 

mg/1 

70507 

60 

Phosphorus,  inorganic  total  ortho  as  P 

mg/1 

(Continued) 


*  Redundant  water  quality  parameter  code. 
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Table  5  (Continued) 


STORET 

HYDRO 

WQ  Component 

Units 

00600 

61 

Total  N 

mg/1 

71887 

*61 

Nitrogen  total  as  N03 

mg/1 

00605 

62 

Organic  N 

mg/1 

00610 

63 

Ammonia  N 

mg/1 

71845 

*63 

Ammonia,  total  as  NH4 

mg/1 

00625 

64 

Total  Kjeldahl  N 

mg/1 

00630 

65 

N02  +  N03-N 

mg/1 

00615 

66 

N02-N 

mg/1 

71855 

*66 

Nitrite  total  as  N02 

mg/1 

00613 

67 

Nitrite  nitrogen  dissolved  as  N 

mg/1 

71856 

*67 

Nitrite  dissolved  as  N02 

mg/1 

00620 

68 

N03-N 

mg/1 

71850 

*68 

Nitrate  N  as  N03 

mg/1 

00618 

69 

Nitrate  nitrogen  dissolved  as  N 

mg/1 

71851 

*69 

Nitrate  N  dissolved  as  N03 

mg/1 

00500 

70 

Residue,  total 

mg/1 

00505 

71 

Residue,  total  volatile 

mg/1 

00515 

72 

Residue,  total  filtrable  dried  at 

105  deg  C 

mg/1 

00530 

73 

Residue  total  non-f iltrable  dried  at 

105  deg  C 

mg/1 

80154 

80154 

74 

Suspended  sediment  cone  -  evap  at 

110  deg  C 

mg/1 

70300 

70300 

75 

Residue  total  filtrable  at  180  deg  C 

mg/1 

32209 

76 

Chlorophyll-A  f luorometric ,  corrected 

ug/1 

32217 

77 

Chlorophyll-A  f luorometric ,  uncorrected 

ug/1 

32211 

78 

Chlorophyll-A  trichromatic,  corrected 

ug/1 

32210 

79 

Chlorophyll-A  trichromatic,  uncorrected 

ug/1 

32230 

80 

Ch lorophy 1 1- A 

mg/1 

60050 

81 

Algae,  total 

cells/ml 

00570 

32 

Biomass,  plankton 

ml/1 

85209 

83 

Algal  growth  potential 

mg/1 

60990 

84 

Zooplankton,  other 

no/liter 

31616 

85 

Fecal  coliform,  memb  filter,  m-fc  broth 

44.5  deg 

no/100  ml 

31673 

86 

Fecal  streptococci,  memb  filter,  kf 

agar,  35  deg 

no/100  ml 

31679 

87 

Fecal  strep  mf  m-ent 

no/100  ml 

(Continued) 


*  Redundant  water  quality  parameter  code. 
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Table  5  (Concluded) 


STORET 

HYDRO 

W Q  Component 

Units 

50051 

88  Flow  rate  instantaneous 

mgd 

50053 

89  Conduit  flow  -  monthly 

mgd 

70301 

70301 

Dissolved  solids  -  sum  of  constituents 

mg/1 

80155 

80155 

Sediment  discharge 

tons/day 

70291 

70291 

Dissolved  sulfate  discharge 

tons /day 

70290 

70290 

Dissolved  chloride  discharge 

tons/day 

70302 

70302 

Dissolved  solids  discharge 

tons/day 

(Sheet  4  of  4) 


PART  V:  LISTS  -  PROJECT  LISTS 


21.  LISTS,  the  second  major  file  group,  defines  the  referencing 
system  used  for  CE  projects  in  the  data  base.  It  consists  of  the  fol¬ 
lowing  files: 

LISTS. CPL  -  Central  Project  List 
LISTS. DPL  -  Deleted  Project  List 

The  development  and  contents  of  these  files  are  discussed  below. 

22.  The  data  base  is  built  around  a  central  list  of  299  reservoirs 
which  have  been  identified  from  various  sources  and  placed  in  the  LISTS .CPL 
file.  The  regional  distribution  of  these  projects  is  shown  in  Figure  2. 

Breakdowns  by  CE  district  and  division  are  given  in  Table  6.  The  follow¬ 
ing  have  been  used  as  criteria  for  inclusion: 

a.  projects  currently  operated  by  the  Corps  of  Engineers. 

b.  projects  having  seasonal  or  permanent  pools. 

The  second  criterion  has  been  applied  to  eliminate  locks  and  small  run- 
of-the-river  impoundments  with  short  hydraulic  residence  times  and  little 
opportunity  for  inducing  water  quality  changes,  at  least  with  respect  to 
eutrophication . 

23.  The  two  primary  sources  of  information  used  to  develop  an 

initial  project  list  include  a  tabulation  of  CE  projects  with  surface 

areas  greater  than  500  acres  compiled  by  Leidy  and  Jenkins5  and  a  map 

0 

of  CE  water  resource  projects  .  Based  upon  CE  Water  Resource  Development 

7 

Reports  and  information  supplied  by  various  CE  district  offices,  the 
initial  list  has  been  screened  to  eliminate  projects  which  are  incom¬ 
plete,  not  currently  under  CE  control,  and/or  do  not  have  appreciable 
pools.  A  separate  list  of  impoundments  which  have  been  eliminated  has 
been  maintained  for  future  reference  (LISTS. DPL).  Because  it  has  not 
been  feasible  within  the  scope  of  this  project  to  compile  and  incorporate 
data  from  detailed,  project-specific  reports,  the  current  project  list 
may  contain  some  impoundments  which  do  not  conform  to  the  above  criteria. 

Similarly,  some  projects  may  have  been  missed.  Inclusion  and/or  screening 

i  i 


of  additional  projects  would  be  possible  with  more  time  devoted  to  com¬ 
piling  and  examining  detailed  reports. 

24.  The  record  format  used  in  the  LISTS. CPL  and  LISTS. DPL  files 
is  given  in  Table  7.  Piles  are  listed  in  Tables  8  and  9,  respectively. 
Each  project  has  been  assigned  a  unique,  three-digit  identification 
code  to  facilitate  referencing  in  the  data  base.  The  location  of  each 
project  is  identified  by  CE  division,  district,  state,  county,  latitude, 
longitude,  and  hydrologic  unit.  Hydrologic  unit  maps  compiled  by  the 

Q 

U.  S.  Geological  Survey  (USGS)  have  been  used  to  provide  basic  location 
data.  Reservoirs  lying  on  the  boundaries  of  states,  counties,  and/or 
hydrologic  units  have  been  referenced  based  upon  dam  location.  State 
and  county  codes  refer  to  the  standard  federal  coding  system  (FIPS) 
documented  in  the  EPA's  STORET5  user's  manual.  The  latitudes  and  longi¬ 
tudes  of  projects  in  which  surface  elevation  monitoring  stations  have 
been  located  refer  to  those  stations,  which  occur  most  frequently  at  dam 
sites.  In  other  situations,  coordinates  have  been  approximated  from 
hydrologic  unit  maps  and  refer  roughly  to  dam  locations. 

25.  As  shown  in  Table  1,  the  project  list  is  cross-referenced  to 
three  independent  data  bases: 

9 

a.  the  EPA  National  Eutrophication  Survey  Working  Papers  . 

b.  the  U.  S.  Department  of  Agriculture  (USDA)  compilation  of 
reservoir  sedimentation  data-*-®. 

c.  the  CE  project  file  compiled  by  Leidy  and  Jenkins5. 

The  cross-referencing  system  facilitates  access  to  specific  information 
on  projects  contained  in  these  sources. 
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PART  VI:  WATS  -  WATERSHED  CHARACTERISTICS 


26.  WATS,  the  third  major  file  group,  contains  information  on 

project  watersheds.  It  consists  of  the  following  three  elements: 

WATS. maps  -  Watershed  Maps 

WATS. POLYS  -  Watershed  Polygon  Coordinates 

WATS . DAREAS  -  Drainage  Area  Characteristics 

This  information  supplements  the  location  descriptors  contained  in  the 

LISTS  files.  Each  element  is  described  below. 

27.  A  set  of  watershed  maps  has  been  compiled  from  the  USGS  hydro- 

8  9 

logic  unit  maps  ,  EPA  National  Eutrophication  Survey  Working  Papers  , 

and  a  report  on  CE  projects  in  the  New  England  Division11.  The  maps  are 
labeled  with  descriptive  data  contained  in  the  LISTS. CPL  file  and  stored 
in  a  loose-leaf  notebook.  Hydrologic  unit  maps,  used  most  extensively, 
are  on  a  scale  of  1:500,000.  An  example  is  given  in  Figure  3.  In  some 
cases,  projects  have  been  built  after  map  publication  and  only  the  water¬ 
sheds  are  shown. 

28.  WATS. POLYS  contains  the  latitude/ longitude  coordinates  of 
polygons  which  contain  projects  and  their  watersheds.  These  coordinates 
have  been  used  to  identify  water  quality  monitoring  stations  in  STORET 
(see  Part  IX) .  Each  record  contains  up  to  five  coordinate  pairs  and  is 
referenced  by  district  and  project  codes.  Coordinates  have  been  estimated 

from  watershed  maps  contained  in  EPA  National  Eutrophication  Survey 

9 

reports  .  The  108  projects  which  were  sampled  under  that  program  are 
represented  in  WATS. POLYS  file. 

29.  WATS. DAREAS  contains  additional  descriptive  information  on 
project  watersheds  in  the  format  given  in  Table  10.  Each  record  is 
referenced  by  district,  project,  and  data  source  codes.  Table  11  lists 
the  elements  of  the  file  along  with  corresponding  data  sources.  Some 
discrepancies  among  multiple  data  sources  for  the  same  project  and 
characteristic  remain  in  the  file,  particularly  in  drainage  areas  and 
mean  flows.  These  could  be  resolved  through  verification  of  the  file 
at  the  district  level.  An  inventory  of  the  file  contents  by  division 
is  given  in  Table  12.  Inventories  by  project  and  district  are  given  in 
Appendix  A. 
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Figure  3 


Sample  Watershed  Map 
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Sources  of  Data  for  the  WATS . DAREAS  File  by  Component 
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Totals  299  295  264  184  282  269  80  67  259  257 


PART  VII:  RESER  -  RESERVOIR  CHARACTERISTICS 

30.  The  fourth  major  file  grouping,  RESER,  contains  detailed 
information  on  reservoir  characteristics.  It  consists  of  the  following 
four  elements: 

RESER. MORPHO  -  Project  Morphometry 
RESER. desc  -  Verbal  Project  Descriptions 
RESER. broch  -  CE  Recreational  Brochures 
RESER.COM  -  Comments 

Each  of  these  elements  is  described  below. 

31.  The  fluctuating  pool  levels  characteristic  of  many  reservoirs 
necessitates  the  compilation  of  morphometric  data  which  are  referenced 
to  pool  elevations.  Volume  and  surface  area  variations  with  elevation 
are  required  for  estimation  of  volume-averaged  water  quality  conditions, 
given  measurem  nts  made  at  specific  depths.  Seasonal  variations  in 
reservoir  volume  and  discharge  induce  variations  in  mean  depth  and 
hydraulic  residence  time  which  may,  in  turn,  influence  the  response  of 
trophic  state  indicators  to  nutrient  loading.  Thus,  detailed  morphometric 
information  is  an  essential  component  of  the  data  base  for  eutrophication 
modelling  and  for  other  general  uses. 

32.  The  record  format  and  contents  of  the  RESER. MORPHO  file  are 
described  in  Table  13.  Each  record  is  referenced  by  district,  project, 
elevation,  and  data  source  code.  In  addition  to  area,  volume,  length, 
width,  and  shoreline  length  data,  the  file  contains  a  series  of  pool  and 
outlet  codes,  as  listed  in  Table  4.  These  codes  provide  supplementary 
descriptive  information  on  pool  allocations  for  various  uses,  ranges  of 
operating  levels,  and  locations  and  types  of  principal  outlets. 

33.  The  RESER. MORPHO  file  was  initially  based  upon  data  extracted 
from  project  design  memoranda.  The  other  principal  data  sources  include: 

(1)  a  report  by  Leidy  and  Jenkins5;  (2)  USGS  water  resources  data  re- 

13  12 

ports,  by  state  and  year  ;  (3)  sedimentation  survey  sheets  ;  and 

(4)  district  and  division  offices.  Information  compiled  from  these 

sources  has  been  coded  and  sorted  by  project  and  elevation.  Initial 

screening  was  done  to  identify  and  correct,  where  possible,  any 
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obvious  errors,  such  as  decreasing  area  or  volume  with  increasing  eleva¬ 
tion  within  a  given  project.  There  were  sufficient  inconsistencies  among 
the  various  data  sources  for  many  projects  to  warrant  independent  veri¬ 
fication  of  the  file.  Accordingly,  the  file  was  distributed  to  the 
districts  through  WES  and  additions  and  corrections  were  made  based  upon 
district  responses.  Final  editing  was  done  to  eliminate  most  of  the 
redundancies  in  the  file  due  to  multiple  data  sources  for  the  same  project 
and  elevation.  A  current  data  inventory  by  division  is  given  in  Table  14. 
Inventories  by  district  and  project  are  given  in  Appendix  A. 

34.  RESER.desc  consists  of  a  collection  of  verbal  project  descrip¬ 
tions  copied  from  USGS  water  resources  data  reports  published  annually 

by  state^.  The  descriptions  are  referenced  by  district  and  project 
codes  and  assembled  in  a  loose-leaf  notebook.  These  descriptions  summa¬ 
rize  hydrologic  monitoring  activities  by  the  USGS,  along  with  important 
project  characteristics  and  purposes.  The  file  currently  contains 
entries  for  260  out  of  299  projects  in  the  central  project  list. 

35.  RESER.broch  is  a  collection  of  brochures  published  by  CE  dis¬ 
trict  and  division  offices  as  guides  to  recreation  in  specific  projects. 
These  usually  contain  detailed  project  maps  which  are  useful  for  locating 
monitoring  stations.  Project  purposes  and  characteristics  are  also  sum¬ 
marized.  Each  folder  is  referenced  by  district  and  project  number  and 
stored  in  a  hanging  file.  Currently,  the  file  contains  information  from 
eight  districts:  Pittsburg,  Huntington,  Louisville,  Nashville,  Vicksburg, 
Tulsa,  Forth  Worth,  and  Sacramento. 

36.  The  RESER.COM  file  contains  miscellaneous  descriptive  infor¬ 
mation  on  various  projects.  This  file  has  been  designed  to  hold  data  or 
comments  which  do  not  conform  to  other  file  formats.  Each  record  is  80 
characters  long  and  is  referenced  by  district  and  project  codes.  A 
listing  of  the  current  version  of  this  file  is  given  Table  15. 
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Totals  299  296  295  296  288  293  285  246  212  146  227 


Table  15 


Listing  of  the  RESER.COM  File 


01157  Mad  River  dam  now  under  control  of  State  of  Connecticut 
01165  Connected  to  Hopkinton  (01167)  at  high  water 
01167  Connected  to  Everett  (01165)  at  high  water 
07232  Project  transferred  to  Wilmington  District  (06) 

26363  Sediment  survey  refers  to  Old  Waco 
28408  No  permanent  pool 

30214  2  Separate  lakes  (W.  twin/E.  twin)  below  elevation  1342 

30333  Project  never  reached  full  pool 

33298  Re-regulating  dam  for  Green  Peter  (33299) 

33305  Re-regulating  dam  for  Detroit  (33293) 

34048  Sediment  survey  refers  to  Old  Don  Pedro 
34051  Sediment  survey  refers  to  Old  Exchequer 


PART  VIII:  HYDRO  -  HYDROLOGY  FILES 


37.  The  fifth  file  group,  HYDRO,  contains  detailed  hydrologic 

data,  organized  in  the  following  files: 

HYDRO. KEY  -  Station  Key 

HYDRO. DAILY  -  Daily  Values 

HYDRO . MONTHLY  -  Monthly  Summaries 
HYDRO. YEARLY  -  Yearly  Summaries 

HYDRO. SUM  -  Grand  Summaries 

This  information  has  been  compiled  to  provide  bases  for  nutrient  budget 
calculations,  estimating  pool  hydraulic  residence  times  (as  influenced 
by  reservoir  elevation  and  discharge) ,  and  depth-averaging  of  water 
quality  observations  (as  influenced  by  reservoir  morphometry  and  pool 
level) .  Because  of  the  stringent  water  quality  sampling  requirements 
for  estimation  of  nutrient  budgets,  streamflow  data  required  for  such 
calculations  have  been  compiled  only  for  those  projects  and  years 
sampled  by  the  EPA  National  Eutrophication  Survey.  Attempts  have  been 
made,  however,  to  compile  reservoir  discharge  and  elevation/contents 
data  from  all  projects  for  1965  to  date  using  three  data  sources :  USGS/ 

14  ^5 

WATSTORE  ,  the  EPA  National  Eutrophication  Survey  ,  and  sedimentation 
12 

survey  sheets  .  The  HYDRO. KEY  file  describes  1307  stations  from  all 
three  data  sources  in  the  format  depicted  in  Table  16. 

38.  The  first  data  source  includes  USGS  stations  monitoring 
reservoir  elevation/contents  or  streamflow  at  or  below  reservoir  dis¬ 
charge  points.  These  stations  have  been  identified  using  the  Master 

Water  Data  Index  of  the  USGS  National  Water  Data  Exchange1*’  and  USGS 

13 

water  resources  data  reports  by  state  and  year  .  Daily  values  have 
been  retrieved  through  STORET^  for  the  period  from  1965  to  the  most 
recent  available  as  of  February,  1980.  Only  those  stations  with  daily 
values  entered  in  WATSTORE  are  included. 

4 

39.  The  EPA  National  Eutrophication  Survey  ,  which  sampled  108  CE 
projects,  assembled  a  hydrologic  data  base  coirpatible  with  its  water 
quality  sampling  network  for  use  in  nutrient  budget  computations.  It 
includes  streamflow  estimates  for  upstream  and  downstream  tributaries. 
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For  each  station,  flows  are  estimated  on  three  time  scales:  daily  (only 
for  the  days  on  which  water  quality  samples  were  taken) ,  monthly  (only 
for  the  months  in  which  water  quality  samples  were  taken) ,  and  normalized 
monthly  (normal  flow  for  each  month) .  This  information  has  been  retrieved 
from  a  tape  provided  by  the  EPA  Corvallis  laboratory15.  Monthly  and 
normalized  monthly  flows  have  been  stored  in  the  HYDRO. MONTHLY  file. 
Because  they  only  refer  to  water  quality  sampling  dates,  daily  flows 
have  been  stored  along  with  the  water  quality  data  in  the  WQ.OBS  file. 

40.  The  third  source  of  hydrologic  data,  sedimentation  survey 

12 

sheets  ,  has  provided  annual  estimates  of  reservoir  total  inflow,  mini¬ 
mum  elevation,  and  maximum  elevation,  typically  for  10  water  years  in 
most  of  the  84  projects  for  which  sedimentation  survey  sheets  have  been 
located.  This  information  has  been  stored  in  the  HYDRO. YEARLY  file. 

41.  The  formats  of  the  DAILY,  MONTHLY,  YEARLY,  and  SUM  hydrology 
files  are  listed  in  Tables  17  through  20,  respectively.  HYDRO. DAILY, 
which  contains  data  from  USGS/WATSTORE  stations  only,  is  in  the  WATSTORE 
format.  It  is  linked  to  the  project  list  through  the  sequence  number 
stored  in  the  HYDRO. KEY  file.  The  other  hydrology  files  contain  direct 
references  to  districts  and  projects.  In  retrieving  daily  values,  all 
parameter  codes  recorded  at  each  station  were  included.  Thus,  the  daily 
file,  and  the  monthly,  yearly,  and  grand  summaries  generated  from  it, 
contain  some  water  quality  information  monitored  by  the  USGS  on  a  daily 
basis  (e.g.,  temperature,  conductivity,  suspended  solids).  Parameter 
codes  and  coverage  are  indicated  in  Table  5. 

42.  Monthly  variations  in  reservoir  discharge  and  elevation  are 
needed  in  order  to  provide  bases  for  calculating  pool  hydraulic  resi¬ 
dence  times  and  volume-averaged  water  quality  conditions  on  a  seasonal 
basis.  Table  21  presents  an  inventory  of  reservoir  discharge,  elevation, 
and  contents  data  monitored  at  USGS  stations  and  contained  in  the 
HYDRO. MONTHLY  file.  Table  21  is  organized  by  division.  Corresponding 
inventories  by  reservoir  and  district  are  contained  in  Appendix  A.  Since 
the  monthly  hydrologic  summary  has  been  generated  from  the  daily  values 
file,  these  inventories  also  reflect  daily  data  holdings. 


43.  The  files  contain  reservoir  discharge  data  for  245  out  of  the 
299  projects  in  the  central  project  list.  Elevation  and  contents  data 
are  included  for  44  and  108  projects,  respectively.  Regional  deficiencies 
in  elevation  or  contents  data  are  particularly  evident  for  the  New  England 
North  Atlantic,  South  Atlantic,  Ohio  River,  and  Missouri  River  Divisions. 
These  deficiencies  need  to  be  corrected,  probably  using  district-level 
information  sources,  in  order  to  provide  a  basis  for  model  evaluations 
under  Phase  II  of  this  project. 
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PART  IX:  WQ  -  WATER  QUALITY  FILES 


Introduction 


44.,  The  sixth  group  of  files,  WQ,  contains  water  quality  infor¬ 
mation  for  CE  projects.,  It  is  organized  as  follows: 

WQ.KEY  -  Station  Key 

WQ.DESTAT  -  Detailed  Station  Descriptions 
WQ.OBS  -  Observations 

WQ.SUM  -  Data  Summary  by  Station  and  Parameter 

WO. maps  -  Station  Maps 

Three  sources  of  water  quality  data  have  been  used:  (1)  EPA's  STORET 

3  17 

system  ;  (2)  the  INFONET  system  used  by  the  Ohio  River  Division  of  the 
Corps  of  Engineers;  and  (3)  miscellaneous  survey  data  for  specific 
projects.  The  numbers  of  stations  and  observations  obtained  from  each 
source  are  listed  in  Table  22.  The  use  of  these  sources  and  resulting 
file  structures  and  contents  are  described  in  the  following  sections. 

STORET  Data  Acquisition  and  Processing 

45.  As  in  Table  22,  STORET  is  the  primary  source  of  water  quality 

information.  The  following  sources  have  been  used  to  identify  station 

codes  and  to  associate  them  with  specific  CE  projects: 

a_.  the  Master  Water  Data  Index  (MWDI)  maintained  by  the 
National  Water  Data  Exchange  of  the  USGS^^. 

b.  the  USGS  Catalogue  of  Information  on  Water  Data^®'^. 

c.  a  list  of  stations  included  in  the  National  Stream 
Quality  Accounting  Network  (NASQAN)  maintained  by  the 
USGS20. 

9 

d.  the  EPA  National  Eutrophication  Survey  Working  Papers  . 
and 

£,  direct  station  identification  retrievals  from  STORET 
using  a  latitude/longitude  search  technique. 

These  sources  are  described  below  in  the  order  used. 


Table  22 


Inventory  of  Water  Quality  Data  by  Source 


Sample 


Agency 

Stations 

Dates 

Observations 

STORET  - 

EPA/NES 

1,637 

16,119 

181,173 

STORET  - 

USGS 

655 

35,326 

592,853 

STORET  - 

CE 

470 

34,890 

322,337 

STORET  - 

States 

541 

18,243 

235,894 

STORET  - 

Other 

357 

8,078 

154,266 

INFONET  ■ 

-  ORD 

763 

16,995 

534,412 

Miscellaneous 

28 

2,259 

TOTAL 

4,451 

129,821 

2,023,194 

60 
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46.  The  Master  Water  Data  Index  (MWDI)  documents  water  quality 
and  quantity  monitoring  activities  by  various  local,  state,  and  federal 
agencies  throughout  the  U.  S.  and  contains  information  on  site  loca¬ 
tion,  agency,  dates,  types  of  measurements,  and  data  storage  media. 

It  does  not  contain  measurements,  but  serves  as  a  means  of  locating 
them.  The  MWDI  registers  all  stations  in  the  EPA  STORET  and  USGS 
WATSTORE  systems,  in  addition  to  information  on  monitoring  activities 
by  agencies  which  do  not  participate  in  other  federal  data  banks. 

Thus,  this  file  represents  the  most  comprehensive  one  available  for 
identifying  monitoring  sites  and  locating  data.  An  initial  list  of 
monitoring  stations  associated  with  specific  projects  was  derived  by 
applying  a  latitude/longitude  search  technique  to  two  large  station 
files  acquired  on  tape  from  the  National  Water  Data  Exchange.  One 
contained  all  stations  in  the  'J.  S.  monitored  by  the  Corps  of  Engineers, 
and  the  other  contained  all  lake  or  reservoir  stations  monitored  by  any 
agency  in  the  U.  S.  The  station/project  matching  derived  from  this 
search  was  verified  manually  by  checking  station  location  descriptions 
and  consulting  maps,  when  needed. 

47.  The  station  listings  derived  from  the  MWDI  did  not  contain 
stations  monitored  by  non-CE  agencies  on  tributary  streams.  The  second 
and  third  sources  listed  above  provided  additional  tributary  stations 
operated  primarily  by  the  USGS.  NASQAN^*"*  stations  are  particularly 
data-rich,  having  been  operated  by  monthly  frequencies  since  1975  with 

a  broad  water  quality  parameter  coverage.  Forty-nine  such  stations  have 
been  located  in  or  directly  below  the  watersheds  of  projects  in  the 
central  project  list* 

48.  The  STORET  station  list  also  includes  all  stations  operated 
by  the  EPA  National  Eutrophication  Survey  in  108  CE  projects.  These 
stations  include  upstream  tributaries,  point  sources,  reservoir  stations, 
and  reservoir  discharge  stations.  Nutrient  loading  calculations  for 
these  projects  will  provide  a  basis  for  the  testing  loading  models  in 
Phase  II. 


49.  Finally,  a  series  of  station  identification  retrievals  were 
done  directly  in  STORET  using  a  search  technique  based  upon  the  latitude/ 
longitude  polygons  contained  in  the  WATS. POLYS  file.  Because  of  cost 
and  time  considerations,  this  technique  was  applied  only  to  projects 
which  were  in  one  of  two  categories:  (1)  sampled  by  the  EPA  National 
Eutrophication  Survey  (since  these  will  be  the  primary  focus  of  Phase  II 
modelling  efforts) ;  or  (2)  without  water  quality  data  derived  from 
other  station  searching  techniques.  An  extracting  option  available 

in  STORET  was  also  used  to  identify  only  stations  for  which  total  phos¬ 
phorus  measurements  were  available. 

50.  Experience  with  these  alternative  station  searching  techniques 
indicates  that  no  one  method  is  completely  satisfactory.  Each  relies  upon 
the  accuracy  of  the  station  characteristics  and  coordinates  entered  in 
the  STORET  file.  Polygon  search  techniques  often  retrieve  extraneous 
stations  or  miss  relevant  stations  because  of  inaccurate  latitude/longi¬ 
tude  entries  in  the  STORET  station  file.  Similarly,  retrievals  which 
depend  upon  station  types  (e.g.,  "stream"  vs.  "lake”)  will  miss  stations 
which  have  been  inaccurately  classified*  For  example,  many  stations 
located  in  reservoir  pools  (based  upon  location  descriptions  and/or 
coordinates)  were  classified  as  "stream"  stations  in  STORET  and  MWDI* 

The  variety  of  methods  employed  to  identify  stations  has  helped  to  pro¬ 
vide  reasonable  project  coverage.  All  station/project  matchings  in  the 
final  STORET  station  list  have  been  checked  manually  with  reference  to 
verbal  station  location  descriptions  and  maps. 

51.  Preliminary  STORET  retrievals  have  been  used  to  screen  out 
sites  with  little  or  no  relevant  information  and  to  verify  station  codes. 
Results  have  been  obtained  in  the  STORET  Inventory  format,  which  lists 
station  descriptions  and  statistical  summaries  of  water  quality  components 
monitored.  Based  upon  these  inventories,  most  stations  with  only  one 
sampling  date  have  been  eliminated  from  the  station  list. 

52.  Following  the  screening  procedure,  a  second  series  of  STORET 
retrievals  has  been  used  to  obtain  copies  of  the  data  on  tape  for  all 
observations  made  after  1964.  The  most  comprehensive  retrieval  format 
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available  from  STORET  has  been  used.  This  format,  termed  "M0RE=5", 
provides  complete  station  descriptions  along  with  observations  of  up 
to  50  different  water  quality  variables  at  each  station. 

53.  A  total  of  100  variables  have  been  selected  for  inclusion  in 
the  data  base,  as  listed  in  Table  5.  This  necessitated  two  retrievals 
for  each  station.  The  selection  of  variables  is  based  upon  the  objec¬ 
tives  of  the  project  and  upon  the  results  of  the  preliminary  station 
inventories,  which  gave  initial  indications  of  data  availability  as  a 
function  of  parameter  code.  The  list  contains  some  redundancies  due 

to  multiple  ways  of  expressing  various  types  of  measurements  (e.g., 
temperature  as  degrees  F  or  degrees  C  or  phosphorus  as  P  or  PO^) .  Con¬ 
version  routines  have  been  used  bo  eliminate  these  redundancies  in  tape 
processing. 

54.  In  a  final  step,  the  STORET  tapes  have  been  processed  to 
generate  one  file  containing  station  descriptions  (WQ.DESTAT)  and  another 
containing  water  quality  observations  (WQ.OBS) .  This  involved  several 
sort/merge  steps  to  combine  data  from  the  individual  STORET  tapes  in  a 
sequenced  form.  Overall,  STORET  has  provided  1,486,523  observations  at 
3,660  stations. 


INFONET  Data  Acquisition  and  Processing 

55.  The  INFONET17  system  used  by  the  Ohio  River  Division  to  manage 
water  quality  data  has  been  accessed  as  a  second  source  of  information. 
Five  tapes  have  been  obtained  from  ORD,  one  containing  station  descrip¬ 
tions  and  the  other  four  containing  water  quality  observations  for  each 
of  the  four  ORD  districts  (Pittsburgh,  Huntington,  Nashville,  and  Louis¬ 
ville)  .  Because  the  organization  and  formats  of  the  INFONET  tapes  are 
different  from  those  obtained  from  STORET,  a  different  set  of  programs 
has  been  written  and  employed  to  extract  the  data  and  process  it  into  a 
form  suitable  for  merging  with  output  from  the  STORET  tape  processing. 

56.  A  systematic  procedure  has  been  used  in  extracting  data  from 
the  ORD  tapes.  In  the  first  step,  stations  of  interest  have  been  selected 
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from  the  ORD  station  tape  based  upon  ORD  project  identification  codes 
and  used  to  generate  a  station  description  file  keyed  to  members  of  the 
central  project  list.  A  list  of  primary  and  secondary  ORD  station  codes 
has  been  extracted  from  the  station  description  tape.  In  the  final  step, 
the  station  code  and  parameter  code  files  have  been  used  to  extract 
relevant  observations  from  the  ORD  data  tapes.  This  process  has  been 
repeated  for  each  district  and  the  resulting  files  have  been  merged. 

A  total  of  763  stations  and  534,412  observations  have  been  derived  from 
this  data  source. 

Miscellaneous  Data  Acquisition  and  Processing 

57.  Water  quality  data  acquired  from  STORET  and  INFONET  have  been 

supplemented  with  miscellaneous  data  which  has  been  manually  coded  and 

entered  directly  into  the  water  quality  files.  This  has  been  done  to 

improve  the  regional  coverage  of  the  water  quality  data  base.  This 

relatively  time-consuming  approach  has  been  limited  to  two  sources : 

21 

(1)  survey  data  obtained  from  Baltimore  District  for  Almond,  Whitney 

Point,  and  Alvin  R.  Bush  Reservoirs;  and  (2)  survey  data  obtained  from 

22 

the  North  Central  Division  for  Eau  Galle  Reservoir  and  Lac  Qui  Parle. 

58.  Water  quality  data  and  station  descriptions  from  these  sources 
have  been  coded  at  WES.  Keypunching  and  verification  have  been  done  at 
MIT.  The  resulting  files  containing  28  stations  and  2259  observations 
have  been  merged  with  data  from  STORET  and  INFONET  in  the  formats  des¬ 
cribed  below. 


WQ  File  Structures 

59.  The  water  quality  data  base  consists  of  four  files  (WQ.KEY, 
WQ.DESTAT,  WQ.OBS,  WQ. SUM)  and  a  set  of  station  maps  (WQ.maps) .  The 
formats  of  the  four  files  are  given  in  Tables  23  through  26,  respectively. 
The  structure  and  contents  of  each  are  discussed  below. 
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(Continued) 


Table  24  (Concluded) 


60.  WQ.KEY  (Table  23)  contains  station  source  and  location  des¬ 
criptors  and  accounting  information  on  the  amount  of  data  in  the  WQ.OBS 
file,  including  the  number  of  observations,  number  of  sampling  dates, 
date  range,  and  depth  range.  Station  descriptors  have  been  derived  from 
the  WQ.DESTAT  file,  and  accounting  information  from  the  WQ.OBS  file. 

Each  station  has  been  given  a  unique,  8-digit  identifying  code.  The 
first  two  digits  represent  CE  district  (Table  1)  and  the  next  three 
represent  CE  project  (Table  8) .  The  last  three  contain  a  code  which  is 
unique  within  each  project.  The  following  conventions  have  been  used 
in  assigning  the  last  three  digits  of  the  station  code: 

001  -  100  STORET  stations  retrieved  in  March  of  1979 
101  -  200  STORET  stations  retrieved  in  March  of  1980 
301  -  400  EPA  National  Eutrophication  Survey  stations 
501  -  600  INFONET  stations  (Ohio  River  Division) 

801  -  900  stations  entered  manually 


Station  numbers  have  been  assigned  sequentially  within  each  category  and 
project,  after  sorting  the  stations  by  STORET  agency  and  STORET  station 
codes.  The  station  coding  scheme  permits  sorting  and  analysis  by 
station,  district,  project,  station,  and/or  data  source.  The  WQ.KEY 
file  contains  4451  records  (one  per  station) ,  sorted  by  the  8-digit 
station  code. 

61.  WQ.DESTAT  contains  detailed  information  on  station  location 
and  data  source.  As  shown  in  Table  24,  it  contains  four  record  types. 

The  fourth  type  is  repetitive  and  contains  up  to  15  lines  of  detailed 
descriptive  text  on  each  station.  WQ.DESTAT  contains  about  31,000 
records,  sorted  by  the  first  ten  digits  of  each  record  (station  code/ 
record  sequence  number) . 

62.  WQ.OBS  (Table  25)  contains  water  quality  observations.  Each 
record  is  identified  by  station,  date,  time,  depth,  and  parameter  code. 
Standard  STORET  remark  codes  identify  measurements  which  are  less  than  or 
greater  than  indicated  values  or  duplicate  values.  The  last  part  of 
each  record  contains  composite  sample  information.  WQ.OBS,  which  con¬ 
tains  2,023,194  records,  sorted  by  the  first  24  columns  (station/date/ 
time/depth/parameter) ,  is  stored  on  tape  (sequential  access  only) . 


63.  WQ.SUM  (Table  26)  contains  a  water  quality  data  summary  by 
station  and  parameter,  derived  from  analysis  of  the  WQ.OBS  file.  Statis¬ 
tics  include  date  range,  depth  range,  value  range,  mean,  standard  devia¬ 
tion,  and  value  percentiles  (25%,  50%,  and  75%).  Summary  statistics  are 
derived  from  the  first  1000  observations  for  each  station/parameter  combi¬ 
nation.  Each  record  represents  one  station/parameter  combination  and  the 
file  contains  about  75,000  records  sorted  by  station  and  parameter  codes. 

64.  To  provide  direct  access  to  water  quality  station  descriptions 
and  data  summaries,  the  contents  of  the  WQ.DESTAT  and  WQ.SUM  file  have 
been  produced  in  microfiche  form.  Frame  format  is  illustrated  in 

Table  27.  The  heading  of  each  frame  contains  the  district,  project, 
station,  and  station  type  codes  and  names.  Station  descriptions  are 
entered  from  the  WQ.DESTAT  file.  The  data  summary  by  parameter  follows. 
Frames  are  sorted  by  district,  project,  and  station  codes.  A  new  fiche 
card  is  begun  with  each  district.  Card  labels  indicate  the  district  and 
project  described  in  the  first  frame.  The  last  frame  of  each  card 
contains  an  index  which  lists  the  project,  station,  and  associated 
frame  coordinates. 

65.  WQ.maps  is  a  collection  of  station  maps  (one  per  project) 
which  have  been  produced  on  a  Calcomp  line  plotter  using  information  in 
the  WQ.KEY  file.  An  example  is  given  in  Figure  4  which,  can  be  compared 
with  the  watershed  map  in  Figure  3.  Stations  are  located  based  upon 
latitude/longitude  coordinates.  Different  plot  symbols  are  used  to 
identify  station  types.  Only  stations  with  more  than  10  observations 
are  plotted.  Adjustments  in  horizontal  and  vertical  scales  are  made 
for  each  map  so  that  a  linear  distance  scale  is  preserved  and  the  map 
fits  within  an  8.5  x  11  inch  area.  A  scale  factor  in  miles  per  inch  is 
derived  from  the  LISTS. CPL  file  and  plotted  with  a  triangle.  These  maps 
are  useful  for  identifying  station  locations  and  for  refining  the  station 
type  codes.  They  are  subject,  however,  to  errors  in  the  station  coordi¬ 
nates  derived  from  STORET  or  INFONET.  Based  upon  the  maps  and  station 
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descriptions,  some  editing  of  obvious  coding  errors  in  station  coordi¬ 
nates  has  been  possible. 


WQ  Data  Inventories 

66.  Table  28  presents  an  inventory  of  water  quality  data  by  station 
type  and  division.  Corresponding  inventories  by  project  and  district 

are  given  in  Appendix  A.  Overall,  271  out  of  299  projects  are  represented 
in  the  water  quality  files.  Remaining  regional  deficiencies  include 
St.  Paul  District  (6  out  of  13  projects) ,  Portland  District  {9  out  of  17 
projects) ,  and  Los  Angeles  District  (0  out  of  2  projects) . 

67.  An  inventory  by  station  type  and  parameter  code  is  given  in 
Table  29.  As  expected,  temperature ,  pH,  and  oxygen  are  the  most  fre¬ 
quently  represented  parameter  codes  in  the  file.  No  data  were  located 
for  one  code  (09 — average  daily  spillway  flow). 

68.  Phosphorus,  chlorophyll-a,  and  Secchi  depth  data  are  parti¬ 
cularly  relevant  to  assessing  eutrophication  problems  and  therefore  to 
Phase  II  modelling  efforts.  Table  30  presents  an  inventory  of  these 
measurements  made  at  pool  stations  (type  codes  2,  4,  or  5)  by  division. 
Corresponding  inventories  by  project  and  district  are  given  in  Appendix 
A.  Out  of  299  projects  in  the  central  project  list,  total  phosphorus 
data  have  been  located  for  211,  chlorophyll-a  data  for  132,  and  Secchi 
depth  data  for  171.  Some  regional  deficiencies  in  total  phosphorus  data 
are  evident  particularly  in  the  New  England  Division  (12  out  of  22 
projects).  North  Atlantic  Division  (6  out  of  15  projec’ s) ,  North  Central 
Division  (6  out  of  16  projects) ,  and  North  Pacifi~  D  ion  (6  out  of 

27  projects)  .  These  are  also  reflected  in  the  ^.•ropi.  i-a  and  Secchi 
depth  inventories.  These  inventories  indicate  that  the  data  base  will 
not  be  sufficient  to  assess  the  trophic  state  of  all  CE  projects  with 
appreciable  pools.  The  complete  coverage  for  roughly  130  projects 
indicates,  however,  that  the  data  base  should  be  generally  sufficient 
for  model  testing  purposes,  the  primary  objective  of  Phase  II.  Regional 
testing  of  models  should  also  be  possible,  with  a  few  exceptions  (e.g., 
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the  lack  of  chlorophyll-a  or  Secchi  depth  data  for  projects  in  New 
England) . 

69.  The  EPA  National  Eutrophication  Survey  is  a  primary  source 
of  data  for  testing  nutrient  loading  models  and  relationships  among 
within-pool  measures  of  trophic  state.  An  inventory  of  data  holdings 
by  agency,  station  type,  and  parameter  code  is  given  in  Table  31.  Two 
agency  groupings  are  used:  "EPA",  representing  the  National  Eutrophi¬ 
cation  Survey;  and  "OTHER",  representing  all  other  agencies  and  monitoring 
programs.  Station  types  include  tributary  (type  code  1) ,  discharges 
(type  code  3),  and  pool  (type  codes  2,  4,  and  5).  The  parameter  list 
includes  a  variety  of  nutrient,  biological,  chemical,  and  optical 
characteristics  pertinent  to  eutrophication  analysis.  Within  each 
category,  the  numbers  of  observations,  sampling  dates,  stations,  projects, 
and  districts  are  indicated,  along  with  the  total  period  of  record  in 
station-months.  Including  data  from  agencies  other  than  the  EPA/NES  has 
more  than  doubled  the  total  numbers  of  observations  of  most  parameters 
and  provided  additional  useful  measurements  for  some  projects,  such  as 
algal  cell  numbers  and  volumes,  turbidity,  and  suspended  solids.  At 

pool  stations,  non-EPA  agencies  provide  more  chlorophyil-a  observations 
(2036  vs.  1499)  concentrated  in  fewer  projects  (43  vs.  108) .  These  and 
other  statistics  in  Table  31  reflect  the  relative  intensities  of  .i-vency 
monitoring  efforts — many  of  the  non-EPA  programs  are  more  intensive 
and  extensive  temporally  than  the  three-date,  one-growing-season  program 
employed  by  the  EPA/NES. 

70.  Based  upon  data  inventories  in  Appendix  A,  some  projects  for 
which  non-EPA  data  monitoring  of  trophic  state  indicators  has  been 
partic  ly  intense  include  the  following: 

06-372  John  H.  Kerr 

15- 399  Eau  Galle 

16- 311  East  Branch  Clarion  River 

18-093  Monroe 

18- 120  Barren  River 

19- 340  J.  Percy  Priest 

26-359  Sam  Rayburn 

29-111  Pomona 

32-204  Kookanusa 
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Table  31 


Inventory  of  Eutrophication-Related  Water  Quality  Components 
by  Station  Type  and  Monitoring  Agency 


TYPE: 

_TR I  B _ 

_D  S  C  H _ 

PPOl 

AGENCY: 

EPA 

OTHR 

ALL 

EPA 

QTHR 

ALL 

EPA 

OTHR 

ALL 

TOTAL 

P 

NOBS _ 

_ 9794_ 

.23.496, 

_33290_ 

1450 

J_2475_ 

_13925  _ 

_6689_ 

15145  21834 

TOTAL 

P 

NDAT 

977 1  _ 

22370 

32141 

1428 

1  1895 

13323 

1410 

78S9 

9307 

total 

P 

NSTA 

806 

997 

1803 

1  1  6 

416 

532 

479 

737 

1216 

total 

P 

_ N  PRJ _ 

109 

214 

_ 236 

_ ioa_ 

21  1 

_ 237 

1  08 

1  88 

71  1 

total 

P 

NOIS 

27 

30 

31 

27 

29 

30 

27 

28 

29 

total 

P 

mths 

8469 

35292 

43761 

1230 

21050 

22280 

2628 

18388 

21016 

ORTHO-P 

NOBS 

9879 

12198 

22076 

1  456 

5433 

6889 

6668 

1  2851 

19=19 

ortho-p 

NDAT 

9855 

1  0386 

20741 

1  434 

5063 

6497 

14  18 

5931 

7349 

ortho-p 

_N  ST  A 

808 

726 

1  534 

_ 115 

31  1 

427 

479 

=76 

mss 

ortho-p 

N  PR  J 

1  09 

186 

223 

108 

193 

225 

1  oa 

167 

205 

ortho-p 

NOIS 

27 

29 

31 

27 

28 

30 

27 

26 

29 

ortho-p 

MTHS 

8459.15528 .24087 

1218.  8670  9888  2628 

1  1  6  64 

1  4  1  R7 

NH3-N 

NOBS 

1  0002 

14010 

24012 

1  464 

6393 

7857 

6632 

1  2294 

1  8966 

NM3-N 

NDAT 

Q979  .13380  _2  3359 

_1442_ 

=264  .  7706 

_L4  1  8 

=  078 

7396 

NH3-N 

NSTA 

308 

829 

1637 

116 

343 

459 

479 

639 

1118 

NH3-N 

N  PRO 

1  09 

197 

221 

108 

198 

226 

1  08 

169 

202 

NH3-N 

NDIS 

_ 27 

_ 29. 

3Q 

27 

28L 

30 

27 

76 

79 

NH3-N 

MTHS 

8508 

21659 

30167 

1231 

1  1  633 

12864 

2628 

14074 

16702 

TKN 

NOBS 

Q970  11554.  21  524. _ 

1461_ 

=  325 

_ 6186 

_=:75_ 

1  7535...- 

TKN 

NDAT 

9953 

1  1032 

20985 

1439 

5211 

6650 

1  39i 

4764 

6145 

TKN 

NSTA 

304 

777 

1581 

1  16 

276 

392 

470 

=  25 

995 

TKN 

N  PRO 

109 

_  175. 

_ 206 _ 

_ LQg_ 

1  73 

71  1 

_1C5_ 

_ 192 _ 

TKN 

NDIS 

27 

29 

31 

27 

26 

30 

26 

24 

28 

TKN 

MTHS 

8505 

18057 

26562 

1230 

8830 

10060 

2599 

1  0869 

1  3468 

N02*3-N 

NOBS 

9999 

12425 

22424 

1  464 

6000 

7464 

6632 

1  1  385 

1  8067 

N02+3-N 

NDAT 

9975 

1  1907 

21883 

1  442 

5906 

7348 

1418 

53C2 

6720 

N02-3-N  NSTA 

80  7.- 

741 

1546 

1  1  6 

276 

2°2 

4?^ 

C  15_ 

234 _ 

NC2+3-N 

N  PR  J 

1  09 

165 

1  94 

108 

154 

188 

1  08 

1  29 

167 

N02+3-N 

NDIS 

27 

28 

29 

27 

23 

29 

27 

24 

28 

N02+3rfi _ 

_M  JHS _ 

_ 8510 

17657  25177 

1231 

9360  .10591 

7^2° 

1  ?  *  Jfi 

147?4 _ 

N03-N 

NOBS 

5979 

17079 

23057 

784 

10629 

11413 

0 

6553 

6563 

NC3-N 

J3DAI _ 

_ 5268_ 

16133  22101 

_ za_i_ 

QQ87  10768  . 

0 

4,7  7  g _ 

N03-N 

NSTA 

773 

526 

1299 

1 1 1 

274 

385 

0 

409 

409 

N03-N 

N  PRJ 

106 

173 

208 

107 

168 

21  1 

0 

153 

153 

N03-N 

_NDLS _ 

28 

28 

30 

_  25 

26 

29 

0_ 

N03-N 

MTHS 

4904 

20780 

25684 

617 

13582 

14199 

0 

7=88 

7  =  38 

DHL-A 

NOBS 

3 

_ 743_ 

746 

6 

318 

->24 

14  =  9 

CHL-A 

NDAT 

3 

728 

731 

6 

313 

319 

1  490 

1  129 

2619 

CHL-A 

NSTA 

1 

74 

75 

2 

30 

32 

482 

1  47 

629 

CHL-A 

N  PRJ 

1- 

29 

TO 

t 

19 

19 

1  Qfi 

4_^ 

1  32 

chl-a 

NDIS 

1 

14 

15 

! 

12 

1  2 

27 

13 

28 

CHL-A 

MTHS 

5 

1 1  89 

1  194 

8 

377 

385 

2660 

3145 

=  805 

ALGAE <#) 

NOBS 

0 

1058 

1058 

0 

875 

875 

0 

1076 

1076 

ALGAE ( » ) 

NDAT 

0 

1053 

1053 

0 

875 

875 

0 

907 

907 

ALGAE! 

NSTA 

o 

48 

48 

Q 

32 

32 

0 

1  'J'l 

ALGAE(A) 

N  PRJ 

0 

26 

26 

0 

29 

29 

0 

59 

59 

algae(*) 

NDIS 

0 

12 

12 

0 

14 

14 

0 

12 

12 

...  M-THS _ 

Q 

_ Q_ 

WnvvM 

_ 0. 

1437 

1432— 

(Continued) 
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Table  31  (Concluded) 


Tvpg ; 
AGENCY! 

AtGAE(V) 

_NOBS _ 

EPA 

0 

TRI8 

OTHR 

7 

ALL 

7 

EPA 

_ g_ 

DSCH 

OTHR 

0 

ALL 

0 

EPA 

0 

POOL 

OTHR 

_ 344_ 

ALL 

_ 444_ 

A  LoA£ ( y ) 

NOAT 

0 

7 

7 

0 

0 

0 

0 

430 

430 

ALGAE( V) 

NSTA 

0 

1 

1 

0 

0 

0 

0 

40 

40 

AlGAEtV] 

N  PR  J 

0 

1 

1 

0 

0 

0 

0 

14 

14 

A  LGAE  (  V  )~ 

NOIS 

0 

1 

1 

0 

0 

0 

0 

1 

1 

A  LGAE ( V ) 

MTHS 

0 

6 

6 

0 

0 

0 

0 

832 

832 

"" SECCHI 

NOBS 

3 

1 1  81 

1  184 

6 

426 

1487 

5091 

6578 

SECCHt 

NOAT 

3 

1152 

1155 

6 

349 

355 

1478 

4969 

6447 

SECCH I 

NSTA 

1 

172 

173 

2 

35 

37 

401 

406 

977 

SECCHI 

NPRJ 

1 

42 

43 

1 

28 

29 

108 

121 

171 

SECCHI 

NOIS 

1 

14 

14 

1 

12 

12 

27 

20 

28 

SECCHI 

MTHS 

5 

2137 

2142 

8 

597 

605 

2629 

1  2314 

1  4043 

TRANS(X) 

NOBS 

s 

40 

45 

24 

12 

36 

6781 

2556 

9337 

transixl 

NOAT 

2 

23 

25 

6 

9 

15 

1419 

368 

1787 

TRANS(X) 

NSTA 

1 

18 

19 

2 

8 

10 

481 

96 

577 

TRANS(X) 

NPRJ 

1 

1 1 

12 

1 

8 

9 

1  08 

21 

121 

TRANS  (  X  {_ 

_ N  0  I  S _ 

t 

2 

3 

1 

1 

2 

_ 27 

4 

27 

TRANS(X) 

MTHS 

3 

6 

9 

8 

10 

18 

2484 

816 

3300 

lIGHTfX)  N  CBS 

0 

66 

66 

0 

60 

60 

486 

3467 

3953 

LIGHT(X) 

NOAT 

0 

21 

21 

0 

24 

24 

329 

531 

860 

UOHUKI 

NSTA 

0 

16 

16 

0 

12 

12 

203 

201 

404 

LIGhT(X) 

NPRJ 

0 

10 

10 

0 

12 

12 

52 

42 

91 

i-ICHT  (X) 

NOIS' 

0 

1 

1 

0 

3 

3 

12 

7 

18 

LIGHT(X) 

MTHS 

0 

7 

7 

0 

17 

17 

335 

494 

829 

— turbioTt' 

NOBS 

0 

24440 

24440 

0 

14708 

14708 

0 

22258 

22258 

T'JRSIOIT 

NOAT 

0 

23815 

23815 

0 

14555 

14555 

0 

1  0580 

l  0580 

TU90IOIT_ 

NSTA 

0 

_ 90  4_ 

904 

0 

_ 350_ 

350 _ 

0 

762 

_ T52_ 

TUflBIOIT 

NPRJ 

0 

201 

201 

0 

i  99 

1  99 

0 

1  71 

1  7  1 

TUR0IOIT 

NOIS 

0 

30 

30 

0 

28 

28 

0 

25 

25 

TUR8I0IT 

MTHS 

0 

_24945_ 

24045 

0 

1  2907 

1  2907 

0 

19675 

19675 

SUSP  SOL 

NOBS 

0 

12054 

12054 

0 

5766 

5766 

0 

10669 

1  0669 

SUSP  SOL 

NOAT _ 

0 

1  1  470 

1  1  470 

0 

5672 

5672 

0 

4234 

.12  34 

SUSP  SOL 

NSTA 

0 

686 

686 

0 

264 

264 

0 

429 

429 

SUSP  SOL 

NPRJ 

0 

1  68 

168 

0 

169 

169 

0 

1  16 

1  16 

SUSP  SOL  NOIS 

0 

_ 29 

_ 29 _ 

_ Q_ 

26 

26_ 

0 

2X 

21 

' SUSP  SQL 

MTHS 

0 

18422 

1  8422 

0 

8277 

8277 

0 

8401 

8401 

OXYGEN _ NOBS _ 8  4  2216_42224 _ 1  8_292  L6_29234_ 

OXYGEN  N D AT  3  33919  33922  5  25106  26112 

OXYGEN  NSTA  1  1062  1063  2  412  414 

Oxygen _ n  prj _ t _ 2.'  2 _ 212 _ t _ 21 1 _ 217. 

OXYGEN  NOIS  1  30  30  1  29  29 

OXYGEN  M  THS  5  40323  40328  8  22191  22199 


FLO* 

FLO* 

FLO* 

N03S 

NOAT 

NSTA 

7934 
7933 
_ 5.90 

24977 
23692 
_ S33_ 

32911 

31625 

1123 

1447 
1447 
_ 109- 

15753 

15173 

268 

17200 

16620 

376 

0 

0 

0 

1468 

1373 

03 

1468 

1373 

93 

FLOW 

NPRJ 

106 

156 

185 

106 

175 

210 

0 

35 

35 

FLO* 

NDIS 

27 

30 

31 

27 

25 

30 

0 

17 

17 

FLO* 

MTHS 

■-LkLJ 

2B482 

■Sul 

KekEFI 

IIIHP 

0 

IBD 

_£202_! 
1494 
4S2 
_L0  6 
27 

2668 


17465  18959 
1051  1533 


29 

31533 


29 

34201 


Thus,  it  will  not  be  necessary  in  Phase  II  to  rely  exclusively  upon 
data  from  the  EPA/NES  for  assessing  relationships  among  within-pool 
measures  of  trophic  state.  The  stringent  water  quality  and  flow  sampling 
program  requirements  required  for  estimation  of  nutrient  budgets  and 
time  limitations  of  this  project  suggest,  however,  that  EPA/NES  data  be 
used  exclusively  for  evaluating  nutrient  loading  models. 


PART  X:  SED  -  SEDIMENTATION  DATA 


i 


71.  The  seventh  major  file  group  describes  sedimentation  charac¬ 
teristics  of  CE  reservoirs.  It  contains  the  following  elements: 

SED. sheets  -  Sedimentation  Survey  Sheets 
SED. RATES  -  Sedimentation  Rate  File 

This  information  has  been  derived  from  a  collection  of  sediment  survey 
data  for  U.  S.  reservoirs  compiled  in  1975  by  the  Agricultural  Research 
Service  of  the  U.  S.  Department  of  Agriculture^'  A  total  of  84  CE 

projects  in  the  central  project  list  were  included  in  that  compilation. 
These  are  identified  by  the  sedimentation  survey  key  in  the  LISTS. CPL 
file  (Table  8) . 

72.  SED. sheets  consists  of  a  collection  of  the  most  recent  sedi¬ 
mentation  survey  sheets  contained  in  the  appendix  to  the  U.  S.  Department 
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of  Agriculture  (USDA)  compilation  .  These  sheets  contain  detailed  in¬ 
formation  on  project  location,  morphometry,  hydrology,  as  well  as  sedi¬ 
mentation.  An  example  is  given  in  Table  32.  Sheets  have  been  assembled 
in  a  loose-leaf  notebook,  identified,  and  arranged  by  district  and  proj- 
ject  code. 

73.  SED. RATES  is  a  file  containing  the  most  recent  estimates  of 
sedimentation  rates  for  each  of  the  84  projects  located  in  the  USDA  com¬ 
pilation^0.  The  format  of  this  file  is  given  in  Table  33.  Many  of  the 
sedimentation  rate  measurements  antedate  the  water  quality  file.  For  45 
projects,  the  most  recent  survey  data  available  were  taken  during  or 
before  1965.  Rate  estimates  for  some  projects,  however,  will  be  useful 
for  testing  relationships  between  sedimentation  rate  and  nutrient  trapping 
efficiency  during  Phase  II  of  this  study. 


i. 


Table  32 


Sample  Sedimentation  Survey  Sheet 


RESERVOIR  SEDIMENT 
DATA  SUMMARY 

SCS'M  «•  «-M 


CJLHTOM  L6U 


iMMt  Or  KUUtvai* 


u.  i.  svmtu«nt  or  «amcuLTviM 

VJH.  CONURVATION  SOVICt 

46- 13b 
oSfl  5mZ3F  no. 


ooiw  Corgi  of  gnalnatM 


23223321 


l w 


A3SL 


17.  LtNOTM  or  MSCKVOI* 


TTTT 


Xoreh  Canadian 


1  S2S  :  0  Canton  2  M 


i.  xtatm  Qklalio— 


L  COUNTY 


auio4 


14  STONAOC 
ALLOCATION 
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mm 
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3.83 

Rang a  (D 

■ 
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Li 
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i. 
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T 
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mu 

■ 

■ 
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3. 

413.360 

24  OATI  or 
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Mmoo  capacity  los*.  Acnwcr 

14  TOTAL  SCO 
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10,490 
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l. 
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■ 
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i. 

271 
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■ 
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Table  32  (Concluded) 


2ft.  QAt*  Of 
auaviY 


*3.  PERTH  DESIGNATION  RANGE  IN  FEET  9E10W,  ANQ  ABOVE.  CREST  ELEVATION 

43-551 55-45  1 45-40  i40-35  135-30  1 30-  25  1 25-  20  1 20-  15  113-8  la-cr  ler  -3  H-3--HC 

PERCENT  OF  TOTAL  SEDIMENT  LOCATED  NlTMlN  OEPTH  DESIGNATION 


May  1953 

4.3 

12.1 

11.9 

15.7 

17.1 

15.2 

12.0 

7.4  | 

4.3 

0.0 

0.0 

0.0 

Oct  1959 

3.2 

7.7 

5.6 

12.0 

16.4 

22.2 

14.7 

9.9 

6.7 

1.6 

0.0 

0.0 

Sepc.1966 

3.5 

13.2 

9.1 

13.1 

16.3 

15.4 

11.5 

7.4 

6.2 

2.3 

1.4 

0.1 

21  OAft  Of 

SO  Ry«V 
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PERCENT  Of 

~otal  ORIGINAL  LE 

NGTH  OF  RESERVO .«  | 

0-10 
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PERCENT  Of  TOTAL  SEGMENT  LOCATED  niTHin  REACH  DESIGNATION  I 

May 

1953 

2.5 

6.4 

10.7 

„ 

21.: 

17.9 

— 

15.6 

4.0 

2.3  0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

Oct 

1959 

2.3 

2.2 

4.  7t 

17. C 

27 . 

19.1 

17.1 

6.1 

3.1  I  1.2 

0.0 

0.0 

0.0 

0.0 

0.0 
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4.3 

5.7 
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-■ 
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16.2 

14.1 

5.3 

3.3  1.3 

1.3 
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RANGE  in  RESERVOIR  OPERATION 


NATCH  'CAR 

44*  £LCV 

WIN  *L*V 

INFLOW.  AC  FT 

aatij*  y  car 

MA*.  EL£V 

WIN  clcv. 

NFLOW.  AC.  FT 

1947  (2  on 

• 

- 

248 
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1625.51 

1596.32 

422.930 

1943 

1599.30 

- 

90,800 

1958 

1516.12 

1613.68 

185.310 

1949 

1614.27 

1595-91 

437.640 

1959 

1615.40 

1613.12 

104, 130 

1950 

1623.34 

1602.35 

532, 180 

I960 

1625.56 

1613. 79 

145  .  230 

1951 

1628.05 

1602.36 

540.420 

1961 

1614.65 

1613.-9 

135.330 

1952 

1605.30 

1398.65 

31.170 

1962 

1615.16 

1613.12 

■06.920 

1953 

1598.75 

1588.66 

29,530 

1963 

1614.21 

1610.21 

61.326 

1954 

1596.75 

1588.30 

52.346 

1964 

1611.93 

1600.36 

20.513 

1955 

1615.50 

1585.66 

223 .  '90 

1965 

1616.18 

1600.41 

159,300 

1956 

1613.27 

1604 . 33 

10.953 

1966 

1615.52 
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7,335 
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1 

1 
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3,987 

1615 

7,379 

114,400 

i 

1595 

2,678 

14.  160 

1620 

9,497 

157,800 

1 

I 

1596.5 

3,018 

18.430 

1625 

11,238 

209,300 

1600 

3,652 

29.950 
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12.756 

269,300 

i 

1 

1603 

^  ,  ^11 
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1635 

14,510 

338,100 

1 

1 

1605 

4.873 

51,390 

1638 

15,700 
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! 

1 

_ 1 _ 

*7  REMARKS  AND  REFERENCES 
y  top  oc  spillway  ;acti  clottd. 

2/ 

1/ 


Spillway  erne  is  at  alavatlon  1613.3. 

Data  of  diversion. 

Reservoir  is  being  operated  ac  "designated"  pool  alavaclon  1615.3  which 
temporarily  provides  auaiclpal  wacar  supply  foe  Oklahoma  Clcy. 

Area  aa  dace rain ad  by  AUK  eosadetse  for  April  1954  drainage  area  data  publication. 

Excludes  4,642  sq.  ai.  of  watershed  not  contributing  to  sadiaanc;  1735  sq.  al. 

above  Tort  Supply  Dam  and  25  sq.  ad.  surface  area  of  Canton  Lake. 

To  provide  a  uniform  presentation  of  data  from  all  sedimentation  resurveys  of 
Canton  Lake,  data  summaries  for  the  1953  and  1959  rasurveys  have  been  revised 
to  conform  with  present  instructions. 

Includes  above  crest  deposits. 

«e  agency  waking  soever  U.  S.  Army  Engineer  District.  Tulsa 

«.  aOINCf  supplying  0*T*  U.  S.  Army  Engineer  District.  Tulsa  «  aatf  October  1970 

TS 


6/ 


7/ 


»Mie«mnui  sA  see 


i&iiuiA  in 
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PART  XI:  NES  -  EPA  NATIONAL  EUTROPHICATION  SURVEY  DATA 


74.  The  eighth  file  group  consists  of  data  and  reports  obtained 

4  9 

from  the  EPA  National  Eutrophication  Survey  '  ,  exclusive  of  the  water 
quality,  hydrologic,  and  drainage  area  data  described  in  previous  sec¬ 
tions.  It  consists  of  the  following  elements: 

NES. reports  -  EPA/NES  Working  Papers 

NES. COMP  -  Compendium  of  Lake  and  Reservoir  Data 

NES. SUM  -  Compendium  Summary 

This  file  group  contains  information  on  108  CE  projects  as  well  as  the 
other  704  lakes  and  reservoirs  which  were  sampled  by  the  EPA/NES  through¬ 
out  the  U.  S.  Besides  providing  detailed  information  on  point-source 
nutrient  loadings  to  CE  projects  needed  for  nutrient  budget  estimation 
in  Phase  II,  it  provides  a  basis  for  comparisons  of  lakes  and  reservoirs 
with  respect  to  morphometric,  hydrologic,  and  nutrient  loading  response 
characteristics.  Such  comparisons,  in  turn,  can  provide  means  of  inter¬ 
preting  the  performance  of  lake  nutrient  loading  models  in  reservoirs. 

75.  During  1972-75,  the  EPA  conducted  a  systematic  survey  of  812 
lakes  and  reservoirs  in  the  U.  S.  The  original  objective  of  the  National 
Eutrophication  Survey  was  to  develop  a  data  base  for  assessing  the 
impacts  of  point-source  nutrient  discharges  on  the  trophic  conditions 

9 

of  lakes  and  reservoirs.  The  NES  produced  a  series  of  working  papers 

summarizing  and  interpreting  the  results  for  each  impoundment  or  lake 

monitored.  Results  have  also  been  summarized  in  a  compendium  format  and 

23 

published  in  four  volumes 

76.  NES. reports  consists  of  the  collection  of  working  papers  des¬ 
cribing  NES  surveys  and  results  in  each  of  108  CE  projects.  Each  report 
has  been  referenced  to  the  data  base  by  district  and  reservoir  code. 

Maps  extracted  from  the  reports  have  been  included  in  the  WATS. maps  file. 

77.  NES. COMP  is  a  computer  file  containing  a  nationwide  summary 
of  NES  data  for  775  lakes  and  reservoirs,  obtained  from  the  Corvallis 

4 

laboratory  of  the  EPA  .  A  sample  printout  of  data  from  the  file  is 
given  in  Table  34.  Similar  printouts  have  been  produced  for  all  CE 
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projects  in  the  compendium  file  and  arranged  in  a  notebook  by  district 
and  project  code.  The  compendium  contains  data  for  106  out  of  108  CE 
projects  sampled  by  the  EPA/NES. 

78.  NES.SUM  is  a  summary  of  the  NES.COMP  file,  with  a  few  modifi¬ 
cations.  The  format  of  the  summary  file  (Table  35)  is  more  convenient 
for  input  to  statistical  analysis  programs,  since  one  record  is  used 
for  each  lake  or  reservoir.  A  number  of  missing  latitude  and  longitude 

coordinates  have  been  added,  based  upon  estimates  derived  from  USGS 

£ 

Hydrologic  Unit  Maps  .  Each  lake  or  reservoir  has  also  been  referenced 
by  CE  division  to  provide  a  basis  for  regional  contrasts  of  lake  and 
reservoir  characteristics. 

79.  A  few  simple  analyses  have  been  done  to  provide  descriptions 
of  types  and  amounts  of  data  contained  in  the  NES.SUM  file.  The  regional 
distribution  of  lakes  and  reservoirs  described  in  the  file  is  depicted 

in  Figure  5.  A  breakdown  of  impoundment  numbers  as  a  function  of  trophic 
state,  impoundment  type,  and  CE  division  is  given  in  Table  36.  The 
numbers  of  CE  projects  are  listed  as  a  function  of  division  and  trophic 
state  in  Table  37. 

80.  The  results  of  preliminary  lake/reservoir  comparisons  which 
have  been  made  using  data  from  the  NES  compendium  tape  are  summarized 
in  Table  38.  Three  groups  of  impoundments  have  been  compared:  natural 
lakes  (N=310) ;  non-CE  reservoirs  (N=359) ;  and  CE  reservoirs  (N=106) . 

The  stratification  of  thirty-six  original  and  conposite  variables  across 

these  groups  has  been  studied  using  the  BMDP-77  conputer  program1. 

Within-group  means  and  Analysis  of  Variance  (ANOVA)  statistics  are 

listed  in  Table  38.  Detailed  results  with  histograms  are  presented  in 
.24 

a  previous  report  . 

81.  While  the  scope  of  this  report  precludes  detailed  interpreta¬ 
tions  of  the  data,  a  few  comments  on  these  results  seem  appropriate: 

a.  At  the  95%  significance  level,  differences  across  groups 
are  evident  in  all  cases  except  longitude,  conductivity, 
phosphorus  retention  coefficient,  nitrogen  retention 
coefficient,  and  the  second  Vollenweider  phosphorus 
loading  statistic25. 


/ 
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b.  Strongest  differences  across  groups  are  apparent  for 
drainage  area  (F=77.5) ,  drainage  area/surface  area 
(F=73.6),  outflow  rate  (F=60.7),  and  drainage  area/ 

*  maximum  depth  (F=53.4) . 

c.  Reservoirs  have  higher  potential  phosphorus  concentra¬ 
tions,  based  upon  phosphorus  loading  and  the  Vollenweider 
statistics ,  but  lower  observed  phosphorus  and  chlorophyll-a 
concentrations  than  natural  lakes.  This  is  an  indication 
that  the  Vollenweider  statistics  may  give  biased  predic¬ 
tions  in  reservoirs  (i.e.,  over-predict  loading  impact) . 

d.  Reservoirs  have  less  transparency,  despite  less  chloro¬ 
phyll-a.  This  is  probably  an  effect  of  mineral  turbidity 
in  reservoirs. 

e.  N/P  loading  ratios  are  somewhat  lower  for  reservoirs  than 
for  natural  lakes;  the  reverse  is  true,  however,  for 
inorganic  N/dissolved  P  ratios  measured  during  the  summer 
within  the  impoundments. 

_f.  The  composite  statistic  (Secchi  depth/mean  depth)  is 
strongly  stratified  across  groups  (F=44.8).  Assuming 
that  the  depth  of  the  photic  zone  is  roughly  twice  the 
Secchi  depth,  the  means  in  Table  38  indicate  that  typi¬ 
cally  62%  of  the  volume  of  lakes  is  available  for 
photosynthesis,  compared  with  30%  in  the  case  of  CE 
reservoirs.  Reservoirs  probably  have  greater  shoreline 
development,  however,  and  may  be  more  influenced  by  photo¬ 
synthetic  processes  occurring  in  littoral  zones. 

g.  While  a  4-degree  difference  is  evident  in  the  average 
latitude  of  CE  reservoirs  as  compared  with  lakes,  a 
detailed  look  at  Figure  5  reveals  that  the  distribution 
of  NES  lakes  is  bimodal,  with  clusters  in  the  North 
(primarily  Minnesota,  Wisconsin,  Michigan,  and  Maine) 
and  in  the  extreme  South  (Florida) .  The  maximum  reser¬ 
voir  density  occurs  between  these  extremes. 

Because  of  the  strong  regionality  of  the  lake  vs.  reservoir  distributions 
evident  in  Figure  5,  it  is  difficult  to  separate  the  effects  of  impound¬ 
ment  type  from  those  of  regior  with  analyses  of  the  type  described  above. 
Thus,  in  future,  more  complete  analyses,  it  will  be  important  to  control 
for  regional  differences  by  comparing  subsets  of  lakes  and  reservoirs 
within  defined  regions. 
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OTHER  =  combination  of  two  or  more  trophic  states 
GRAND  TOTAL  =  number  of  impoundments  in  LISTS. CPL  file 


Table  38 


Summary  of  Lake/Reservoir  Comparisons  Derived  from  EPA/NES  Compendium 

WITHIN -GROUP  MEANS  AN OVA  RESULTS 

(N-309)  (N-360)  (N-106) 


* 

c 

~ 

Variable 

Nat. 

Lakes 

Non-CE 

Reserv. 

CE 

Reserv. 

F 

Prob 

(greater  F) 

Latitude  (degrees  N) 

41.6 

39.0 

37.6 

34.8 

<  .0001 

A 

Longitude  (degrees  W) 

91.7 

93.6 

92.3 

1.9 

.16 

G 

Drainage  Area  dan2) 

222. 

1358. 

3228. 

77.5 

< .0001 

G 

Surface  Area  Oca2) 

5.6 

3.6 

34.5 

44.4 

N 

G 

Volume  (km2m) 

27.3 

50.8 

239. 

36.4 

I* 

G 

Mean  Depth  (a) 

4.5 

5.7 

6.9 

10.4 

M 

G 

Maximum  Depth  (m) 

10.7 

15.8 

19.8 

18.8 

ft 

G 

Hydraulic  Res.  Time  (yr) 

.74 

.23 

.37 

23.5 

ft 

G 

Overflow  Rate  (m/yr) 

6.5 

25. 

19. 

40.9 

ft 

G 

Outflow  Rate  (km2m/yr) 

47.9 

236. 

650. 

60.7 

•• 

G 

Dr.  Area/  Surf.  Area 

33. 

156. 

93. 

73.6 

•• 

G 

Dr.  Area/Volume (1/m) 

6.6 

26. 

13. 

39.5 

ft 

G 

Maximum  Depth/Mean  Depth 

2 . 5 

2.8 

2.9 

7.3 

.0008 

G 

Relative  Depth 

.40 

.47 

.30 

8.6 

.0002 

G 

Surf  A/  Max  Depth 

.53 

.56 

1.7 

24.6 

<  .0001 

G 

Dr.  Area/Max  Depth 

20. 

86. 

162. 

53.4 

A 

pH 

8.06 

7.74 

7.63 

24.1 

< . 0001 

G 

Alkalinity  (mg/1) 

87. 

63. 

6$. 

8.6 

.0002 

G 

Conductivity  (UMhos/cm) 

253. 

214. 

208. 

2.4 

.0950 

G 

Total  Phosphorus  (mg/1) 

.054 

.053 

.039 

3.6 

.0268 

G 

Dissolved  P  (mg/1) 

.021 

.018 

.011 

9.5 

< . 0001 

G 

Inorganic  Nitrogen  (mg/1) 

.20 

.26 

.30 

9.7 

< .0001 

G 

Secchi  Depth  (m) 

1.4 

1.2 

1.1 

7.7 

.0005 

G 

Chlorophyll-a  (mg/1) 

14. 

10. 

8.9 

11.4 

< .0001 

G 

Algal  Assay  Yld  (mg/1) 

2.5 

2.2 

1.3 

4.0 

.0194 

G 

2 

P  Loading  (g/»  -yr) 

.87 

2.9 

1.7 

29.1 

< .0001 

A 

P  Retention  Coef. 

.36 

.35 

.40 

0.7 

.51 

A 

NPS  Load  Fraction  -  P 

.72 

.80 

.81 

8.4 

.0002 

G 

N  Loading  (g/m2-yr) 

IS. 

45. 

28. 

22.6 

< .0001 

A 

N  Retention  Coef. 

.24 

.20 

.17 

2.5 

.081 

A 

NPS  load  Fraction  -  N 

.87 

.93 

.96 

18.7 

<.0001 

G 

N  Load/P  Load 

20. 

16. 

16. 

8.0 

.0004 

G 

Inorg  N/  Diss.  P 

9.1 

14. 

26. 

29.4 

< . 0001 

G 

♦Vollenweider  Stat.  1 

.31 

.56 

.42 

15.2 

It 

G 

+Vollenweider  Stat.  2 

056 

.067 

.057 

1.9 

.1586 

G 

Secchi  D/  Mean  Depth 

.31 

.20 

.15 

44.3 

< .0001 

*  C  =  Code  for  Type  of  Mean  (A  -  Arithmetic,  G  -  Geometric) 

+  Statistics  for  Assessing  Phosphorus  Loading  Impacts  on  Lake  Eutrophication; 
Stat.1  -  Lp/es'5-(ref.25) ;  Stat.2  -  (Lp/Qa)/{1  +  /r)  -  (ref. 26) 


PART  XII:  NUMERICAL  CHARACTERIZATION  OF  RESERVOIR  HYPSOGRAPHIC  CURVES 


Introduction 


82.  Estimates  of  reservoir  volume  and  area  variations  with  eleva¬ 
tion  are  required  for  computing  volume-averaged  concentrations  and  pool 
hydraulic  residence  times.  This  information  is  generally  available  in 
the  form  of  a  table  which  lists  area  at  and  volume  below  specific  eleva¬ 
tions.  Morphometry  tables  have  been  compiled  for  most  of  the  303  CE 
projects  in  the  data  base.  This  paper  describes  numerical  investigations 
of  these  tables,  specifically  covering  the  relative  accuracies  of  various 
curve-fitting  and  interpolation  techniques.  Results  will  be  used  to 
design  methods  for  summarizing,  storing,  and  applying  morphometric 
information  in  other  phases  of  the  project.  Results  are  also  relevant 

to  other  aspects  of  reservoir  management,  specifically  including  the 
estimation  of  reservoir  volumes  and  sedimentation  rates  from  range  survey 
or  contour  area  data. 

83.  This  study  has  been  performed  prior  to  completion  of  the 
project  morphometry  file.  A  subset  of  data  has  been  selected  which 
covers  147  projects,  each  with  at  least  five  listed  elevations  and  with¬ 
out  obvious  errors  (such  as  decreasing  areas  or  volumes  with  increasing 
elevation) .  A  total  of  1285  elevation/are a/ volume  data  points  have  been 
compiled  for  these  147  projects  from  various  sources.  Results  of  this 

study  are  summarized  below.  Details  can  be  found  in  a  working  paper 
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submitted  as  a  progress  report  under  this  contract 

Approach 

84.  The  objective  is  to  design  and  evaluate  methods  for  estimating 
area  and  volume  at  any  elevation,  giver  information  available  for  specific 
elevations.  Methods  can  generally  be  classified  as  curve-fitting  or 
interpolating.  Accuracies  of  these  methods  can  be  assessed  by  comparing 
observations  and  predictions  of  the  following: 


a.  V(V)  =  volume  estimated  from  volume/elevation  points 

b.  A(V)  =  area  estimated  from  volume/elevation  points 

c_.  A  (A)  =  area  estimated  from  area/elevation  points 

d.  AV(A)  =  incremental  volume  estimated  from  area/elevation 
points 

Statistics  a  and  c  are  simple  curve-fitting  or  interpolating  problems. 
Statistic  b  involves  differentiation  of  the  volume  curve  to  estimate  area 


~  3V 

A(V)  =  ||  (1) 

where 

Z  is  the  total  depth,  ft 

Statistic  d  involves  integration  of  the  area  curve  between  specific 
depth  limits  (Z^  and  Z^)  to  estimate  incremental  volume: 


AV  (A) 


(2) 
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85.  In  testing  each  method,  a  jackknife  procedure  has  been  used 
to  estimate  the  above  statistics  for  internal  elevation  points  (i.e., 
tests  are  not  made  for  the  top  and  bottom  listed  elevations  in  each 
project,  although  these  elevations  are  used  in  the  fitting  or  inter¬ 
polating  process) .  In  applying  the  jackknife,  information  from  the 
point  being  tested  is  not  used  in  estimating  the  parameters  of  the  pre¬ 
dictive  function.  For  instance,  estimates  of  area  or  volume  at  the 
second  listed  elevation  are  derived  exclusively  from  information  in  the 
first  and  third  through  last  listed  elevations. 

86.  Reported  values  and  predictions  are  compared  using  the  follow¬ 
ing  error  statistic: 


D  = 


(3) 


where 

Y  =  reported  value  (V,  A,  or  AV) 

Y  =  estimated  value 
D  =  error  statistic 
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For  errors  of  the  magnitude  studied  here,  the  value  of  D  approaches  the 
difference  between  the  reported  and  predicted  values,  expressed  as  a 
fraction  of  their  average.  In  most  cases,  the  D  statistic  seems  to  have 
reasonably  symmetric  error  distributions.  For  each  of  the  four  types 
of  area/volume  comparisons,  estimates  of  bias  and  standard  error  in  D 
have  been  derived  for  a  variety  of  predictive  methods,  as  described  below. 


Curve-Fitting  Schemes 


87.  A  single- term  power  function  has  some  useful  properties  for 
fitting  these  types  of  curves  and  is  the  simplest  of  the  methods  evalu¬ 
ated: 


V  = 


A  = 


(4) 

(5) 


Z  =  E-  E 

o 


(6) 


where 

c  ,  c  ,  b  b  *  enpirical  parameters 
v  a  v  a 

Z  =  total  depth  (ft) 

E  -  elevation  (ft,  msl) 

E  =  elevation  at  V=A=0  (ft,  msl) 
o 

Model  parameters  can  be  estimated  using  linear  regression  by  transforming 
V,  A,  and  z  values  to  logarithms.  The  following  equalities  should  hold 
if  the  model  is  valid  for  a  particular  reservoir: 


3V  .  _bv "  1  „ 

—  =  bvCvZ  =  A 

.*.c  =  b  c 

a  v  v 


b  =  b  -  1 
a  v 


(7) 

(8) 
(9) 


Equations  (8)  and  (9)  can  be  used  to  test  the  consistency  of  the  volume 
and  area  curves.  The  slopes  of  volume  and  area  vs.  total  depth  on  a 
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log/log  plot  should  differ  by  1.0.  Dividing  equation  (4)  by  equation  (5) 
yields  the  fol’owing: 


(10) 


If  the  basic  model  holds,  the  ratio  of  volume  to  area  (or  mean  depth) 
should  be  proportional  to  the  total  depth.  Thus,  the  model  can  be  tested 
through  regression  analysis  of  the  following  equation: 


(11) 


A  t-test  can  be  applied  to  the  mean  and  standard  error  b within  each 
project  to  determine  whether  the  estimate  is  significantly  different 
from  1.0. 

88.  In  testing  the  power  function  model,  the  parameters  in  equa¬ 
tions  (4) ,  (5) ,  and  (11)  have  been  estimated  for  each  of  147  reservoirs. 
The  statistical  distributions  of  the  optimal  slope  parameters  (b  ,  b  , 
bz)  are  shown  in  Figure  6  on  log-normal  probability  scales.  Median 
values  are  2.97,  1.97,  and  0.97,  respectively.  Thus,  in  the  "typical" 
reservoir,  volume  and  area  increase  approximately  as  the  cube  and  square 
of  total  depth,  respectively.  This  corresponds  to  the  solid  geometry 

of  an  inverted  cone  or  pyramid.  About  10%  of  the  projects  have  bv 
values  exceeding  4  and  4%  have  by  values  less  than  2.  The  parameters 

b  and  b  contain  roughly  the  same  information  as  the  statistically- 

^  28 

defined  "lake  form"  proposed  by  Hakanson 

89.  Based  upon  t- tests,  the  bz  parameter  differs  significantly 
(p=.05)  from  1.0  in  about  35%  of  the  projects.  This  means  that  the 
single-term  power  function  model  fits  about  two  thirds  of  the  projects. 
In  the  remaining  one  third,  the  slope  parameters  bv  and  ba  are  Inconsis¬ 
tent  and/or  variable  with  depth. 

90.  Table  39  lists  error  statistics  for  this  model.  Estimates  of 
the  first  (V(V))  and  third  (A(A) )  statistics  are  derived  directly  from 
equations  (4)  and  (5) ,  respectively.  Areas  are  estimated  from  volume 


99 


1 


1 

1 

Table  39 

Evaluation  of  the  Power  Function  Model 


Gross* 

Median** 

Statistic 

Standard  Error 

Standard 

V(V) 

.097 

.062 

A(V) 

.115 

.081 

A(A) 

.097 

.062 

AV  (A) 


177 


081 


curves  according  to  equation  (7).  Changes  in  volume  within  various 
strata  are  estimated  from  area  curves  according  to: 


AV  (A) 


A222-A1Z1 

ba+1 


(12) 


(13) 


Standard  errors  of  D  range  from  .097  (V(V))  to  .177  (AV(A)).  The  higher 
standard  error  of  AV(A)  could  be  related  to  the  fact  that  it  is  an  incre¬ 
mental  value  which  is  more  subject  to  measurement  error  on  a  percentage 
basis  than  is  total  volume  or  area.  A  D  standard  error  of  .097  corres¬ 
ponds  roughly  to  a  standard  error  of  10%,  or  to  95%  confidence  limits 
of  ±  20%  in  an  estimated  area  or  volume.  The  differences  between  the 
gross  standard  error  and  the  median,  within-pro ject  standard  error 
reflects  the  skewness  of  the  error  distribution  across  projects,  i.e., 
the  model  seems  to  fit  some  projects  considerably  better  than  others, 
as  indicated  by  the  bz  distribution. 

91.  In  order  to  improve  upon  the  above  model,  variations  in  the 
slope  parameters  b^  and  b&  with  depth  must  be  accounted  for.  The  sim¬ 
plest  way  of  doing  this  is  to  include  higher-order  terms  in  the  regres¬ 
sion  equations: 


m 

V*  =  £  c.Z*1  (14) 

i=0  1 

m 

A*  =  £  d.Z*i  (15) 

i=0  1 


where 

c^,  d^  =  empirical  coefficients  (i=0,  m) 

*  =  superscript  denoting  log^Q  transformation 
m  =  maximum  degree  of  polynomial 


The  error  distributions  of  these  functions  have  been  evaluated  for 
maximum  degrees  ranging  from  1  to  5.  For  m=l,  the  scheme  is  equivalent 


to  the  single-term  power  function  model  discussed  above.  For  comparative 
purposes,  linear  polynomial  functions  have  also  been  tested: 

m 

V  =  E  c-zl  (16) 

i=0  1 

m 

A  -  £  d.Z1  (17) 

i=0  1 

For  each  reservoir,  the  maximum  degree  of  the  polynomial  has  been  limited 
to  the  minimum  of  m  and  the  number  of  elevations  in  the  table  minus  two. 

92.  The  error  statistics  in  Table  40  indicate  that  logarithmic 
polynomials  are  preferable  to  linear  ones  according  to  most  criteria. 

Log  transformation  tends  to  linearize  the  relationships  and  renders  them 
easier  to  fit  with  low-order  polynomial  terms.  Some  reduction  in  error 
is  achieved  by  including  quadratic  and  cubic  terms  in  depth.  Addition 
of  fourth-  and  fifth-order  terms,  however,  tends  to  increase  estimation 
error,  presumably  because  higher-order  polynomials  can  have  local  minima 
and/or  maxima  between  the  fitted  points.  These  are  artifacts  of  the 
fitting  process  which  can  cause  large  estimation  errors  between  the 
fitted  points.  Cubic  polynomials  apparently  have  about  the  best  combina¬ 
tion  of  flexibility  and  smoothness  for  these  purposes  and  data  densities. 

Interpolation  Schemes 

93.  Interpolation  methods  can  be  used  as  alternatives  to  the 
curve-fitting  techniques  described  above.  Interpolation  essentially 
involves  fitting  different  curves  to  different  sections  of  each  volume/ 

area/elevation  table.  A  variety  of  interpolation  methods  have  been 

26 

tested  on  the  same  data  set  used  above  .  These  involve  different  trans¬ 
formations  of  the  volume  and  area  points,  including  inverse  power  func¬ 
tions  (first  through  fifth  order)  and  logarithmic.  Of  the  methods 

tested,  one  involving  logarithmic  transformations  of  volume,  area,  and 

2  6 

total  depth  points  has  been  shown  to  have  the  lowest  error  statistics 
The  errors,  however,  are  essentially  equivalent  to  the  error 
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Table  40 


Evaluation  of  Polynomial  Functions 


Tr ans  f o  rmation 

Maximum 

Degree 

Gross  Standard  Error 

V(V)  A (V)  A (A) 

AV(A) 

logarithmic 

1 

.097 

.115 

.101 

.182 

2 

.069 

.083 

.081 

.166 

3 

.062 

.067 

.081 

.164 

4 

.216 

.292 

.129 

.173 

5 

1.198 

1.280 

.534 

.491 

linear 

1 

.371 

.615 

.200 

.281 

2 

.249 

.170 

.134 

.212 

3 

.134 

.088 

.106 

.196 

4 

.150 

.085 

•  111 

.193 

5 

.221 

.136 

.147 

.265 
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characteristics  of  the  best  curve-fitting  scheme  (log/log  cubic  poly¬ 
nomials)  .  Results  suggest  that  either  of  these  methods  probably 
approaches  the  limits  of  data  accuracy. 

Conclusions 


94.  In  one  sense,  numerical  interpolation  methods  are  preferable 
to  fitted  curves  because  the  former  are  more  flexible  and  do  not  rely 
strongly  upon  particular  forms  or  curve  shapes.  Interpolation  requires 
storage  of  and  access  to  the  entire  table,  however,  as  opposed  to  a  few 
parameters  in  the  case  of  a  fitted  function.  Fitted  curves  also  provide 
some  smoothing  of  the  entries  in  the  table  which  may  serve  to  filter 
errors . 

95.  Based  upon  relative  errors,  storage  requirements,  and  compu¬ 
tational  considerations,  log/log  cubic  polynomials  seem  to  be  the  best 
alternative  for  summarizing  hypsographic  curves,  given  data  of  the 
type  examined  here.  The  approximate  equivalence  of  the  V(V)  and  A(V) 
error  statistics  indicates  that  storage  of  the  parameters  of  the  volume 
curve  alone  would  be  adequate  as  a  basis  for  estimating  both  area  and 
volume.  Algebraic  differentiation  of  the  volume  polynomial  can  be  used 
to  estimate  area  at  any  given  depth.  Thus,  both  the  area  and  volume 
curves  can  be  summarized  by  a  total  of  four  polynomial  parameters 
according  to  the  following  scheme : 


Z*  =  log.  (E  -  E  )  (18) 

10  o 

V*  =  log1Q(V)  (19) 

A*  =  log^Q(A)  (20) 

V*  =  c  +  c, Z* +  c_Z*2 + c,Z*3  (21) 

o  1  2  3 

A  =  (c  +  2c  Z*  +  3c  Z*2) V/ (E  -  E  )  (22) 

12  3  o 
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For  cases  in  which  data  from  only  a  few  elevations  are  available  and/or 
the  quadratic  or  cubic  terms  do  not  add  appreciably  to  accuracy  (as 
assessed  via  stepwise  polynomial  regression) ,  first-  or  second-order 
polynomials  may  be  used. 

96.  Use  of  the  method  requires  knowledge  of  the  base  elevation, 

Eq.  If  not  available,  it  can  be  estimated  approximately  by  extrapolating 

a  plot  of  vs.  E  to  the  horizontal  axis.  Any  error  in  E  estimated 

o 

in  this  way  would  be  offset  in  subsequent  estimation  of  the  polynomial 
coefficients . 

97.  The  same  functions  can  be  used  in  very  data-limited  situations 
in  which  only  estimates  of  mean  depth,  maximum  depth,  and  surface  area 
are  available.  Approximate  parameter  estimates  in  this  case  are  given 
by: 


c 

o 


c3  =  0 


z  /z 

max'  mean 


log  (AZ  )  -  c  log  (Z  ) 
10  mean  X  10  max 


(23) 

(24) 

(25) 


98.  To  insure  that  the  fitted  polynomial  is  not  used  outside  of 
the  range  of  data  availability,  maximum  and  minimum  elevations  should 
also  be  stored  with  the  fitted  coefficients.  It  will  be  necessary  to 
test  the  fitted  coefficients  to  insure  that  the  areas  estimated  through 
differentiation  (equation  (22))  are  positive  and  increasing  with  eleva¬ 
tion  throughout  the  application  range. 
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SLOPE 


Figure  6 

Distributions  of  Volume  and  Area  Slope  Parameters 


PERCENTILE 
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PART  XIII:  VARIABILITY  OF  TROPHIC  STATE  INDICATORS  IN  RESERVOIRS 


Introduction 

99.  The  development  and  testing  of  empirical  eutrophication  models 
for  reservoirs  requires  averaging  of  water  quality  measurements  over 
spatial  and  temporal  scales.  If  within-pool  water  quality  variations 
are  not  random  with  respect  to  date,  station,  or  depth,  then  summary 
statistics  for  a  given  reservoir  will  depend  to  some  extent  upon  the 
particular  da ta-r eduction  method  employed.  The  choice  of  reduction 
method  may,  in  turn,  influence  conclusions  regarding  the  adequacy  of 
existing  models  as  well  as  the  parameter  estimates  of  any  new  models 
which  may  be  developed. 

100.  There  is  no  standard  data  reduction  procedure  which  can  be 

used  prior  to  model  development,  testing,  or  application.  Methods  have 

included,  for  example:  (1)  taking  the  median  or  mean  of  all  within-pool 
23 

observations  ;  (2)  sequential  averaging  over  depths,  stations,  and 

29  30 

dates  :  (3)  seasonal  averaging  within  specific  depth  ranges  ;  and 

(4)  various  weighted-averaging  schemes  which  reflect  morphometric  char¬ 
acteristics.  As  compared  with  natural  lakes,  many  reservoirs  pose 
special  data  reduction  problems  due  to  extreme  spatial  and/or  temporal 
variations  in  conditions. 

101.  In  this  section,  a  subset  of  the  current  CE  water  quality 
data  base  is  analyzed  in  order  to  describe  spatial  and  temporal  varia¬ 
tions  in  trophic  state  indicators  within  a  group  of  reservoirs.  The 
analysis  covers  spatial  relationships,  seasonal  relationships,  variance 
components,  and  error  estimation.  Implications  of  the  results  are  dis¬ 
cussed  with  respect  to  the  design  of  monitoring  programs  and  use  of  the 
data  in  model  development  and  evaluation.  Details  on  many  of  the 

31  32 

procedures  used  and  results  discussed  below  can  be  found  elsewhere  ' 


107 


Data  Base 


9 

102.  EPA  National  Eutrophication  Survey  (NES)  data  from  484 
stations  located  within  108  CE  projects  have  been  used  as  a  basis  for 
this  analysis.  The  relatively  uniform  sampling  program  designs  used  by 
the  NES  provide  data  which  are  suitable  for  statistical  treatment.  One 
drawback,  however,  is  that  under  this  program,  reservoirs  were  typically 
sampled  only  three  times  during  one  growing  season.  In  Phase  II  of  this 
project,  there  are  plans  to  examine  data  from  other  agencies,  which,  in 
many  cases,  are  more  intensive  and/or  cover  longer  periods. 

103.  Surface  total  phosphorus,  Secchi  depth,  and  chlorophyll-a 
values  have  been  expressed  in  terms  of  Carlson's  Trophic  State  Indices 
(1^,  IT  and  1^,  respectively)^: 

Ip  =  4.2  +  33.2  log1QP  ,  (26) 

I  =  60  -  33.2  log  Z  (27) 

T  ^10  s 

I  =  30.6  +  22.6  log  B  (28) 

D  IV J 

where 

P  =  total  phosphorus  concentration  (mg/m^) 

Z  =  Secchi  depth  (m) 

s  3 

B  =  chlorophyll-a  concentration  (mg/m  ) 

The  indices  are  calibrated  such  that  the  three  versions  are  equivalent, 
on  the  average,  when  applied  to  midsummer,  epilimnetic  data  from  northern, 
natural  lakes.  Expression  of  measurements  on  the  above  scales  tends  to 
reduce  the  skewness  in  the  distributions  of  the  variables  and  provides 
benchmarks  for  assessing  reservoir  trophic  state  relationships  in  com¬ 
parison  to  those  typical  of  natural  lakes.  A  statistical  data  summary  is 
given  in  Table  41. 
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Table  41 

Statistical  Summary  of  EPA/NES  Trophic  Index 
Data  from  108  CE  Projects 

standard 


Variable 

n 

mean 

deviation 

minimum 

maximum 

XP 

1421 

55.8 

13.1 

24.1 

99.0 

1493 

58.9 

12.7 

25.4 

112.9 

In 

1505 

48.2 

10.3 

o 

• 

00 

81.6 

1 


Case  Studies  of  Spatial  Relationships 

104.  Spatial  variations  typical  of  a  few  reservoirs  are  depicted 
in  Figures  7-10.  Mainstem  stations  are  displayed  in  downstream  order 
within  each  reservoir  (not  to  scale)  and  mean  values  are  plotted  for 
each  version  of  the  Trophic  State  Index.  These  plots  provide  initial 
perspectives  on  the  types  of  spatial  trends  and  relationships  which  can 
be  found  in  reservoirs  and  are  important  supplements  to  the  more  formal 
statistical  treatment  of  the  data  presented  in  subsequent  sections.  The 
plots  seem  to  illustrate  a  number  of  important  controlling  processes, 
which  are  discussed  below  in  relation  to  each  reservoir. 

105.  Figure  7  contains  data  from  the  White  River  system  on  the 
Arkansas/Missouri  border.  Four  reservoirs  are  connected  in  series: 
Beaver,  Table  Rock,  Taneycomo,  and  Bull  Shoals.  With  the  exception  of 
Taneycomo,  they  are  all  deep  reservoirs  with  hydraulic  residence  times 
in  excess  of  250  days.  Trophic  State  Index  behavior  in  the  First  reser¬ 
voir,  Beaver,  is  considerably  different  from  that  typical  of  the  down¬ 
stream  impoundments.  Concurrent  reductions  in  I  and  IT  most  likely 
reflect  sedimentation  and  the  three  index  curves  do  not  converge  until 
the  dam.  Once  most  of  the  sediment  load  has  been  removed  in  Beaver, 
agreement  among  the  index  curves  is  good  at  most  downstream  stations. 
Increases  in  Table  Rock  probably  reflect  the  influence  of  a  major  point 
source  which  accounts  for  more  than  70%  of  the  total  phosphorus  loading 
to  that  reservoir.  The  relatively  low  values  of  I  in  Taneycomo  can  be 
explained  by  the  direct  influence  of  subsurface  discharge  from  upstream 
Table  Rock  Dam.  Taneycomo  has  a  short  hydraulic  residence  time  (7  days) 
and  surface  water  temperatures  at  the  times  when  summer  chlorophyll-a 
samples  were  taken  roughly  matched  temperatures  at  the  100-foot  level 
just  above  Table  Rock  Dam  (~15°C) .  Taneycomo' s  short  hydraulic  residence 
time  is  apparently  insufficient  to  permit  recovery  of  temperatures  and 
algal  populations  from  those  typical  of  the  Table  Rock  hypolimnion. 
Decreases  in  all  versions  of  the  index  are  evident  moving  downstream  in 
Bull  Shoals,  and  relatively  stable  conditions  are  reached  over  the  last 
four  stations. 
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Figure  7  White  River  System 

WHITE  RIVER  SYSTEM 
BEAVER  TABLE  ROCK  T.C.  BULL  SHOALS 


DOWNSTREAM  ORDER  -* 
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Figure  8  Sakakawea 

SAKAKAWEA  (t=584  days) 


106.  Lake  Sakakawea  (Figure  8) ,  on  the  Missouri  River  in  Montana, 
shows  TSI  variations  over  40  units,  or  sixteenfold  differences  in  trans¬ 
parency  and  total  phosphorus.  Upstream  portions  were  classified  by  the 
EPA/NES  as  "hyper-eutrophic"  and  stations  near  the  dam  as  "oligotrophic" . 
Below  the  second  station,  the  chlorophyll-a  index  follows  but  remains 
roughly  5-10  index  units  below  the  phosphorus  and  transparency  indices. 

107.  Old  Hickory  (Figure  9) ,  on  the  Cumberland  River  in  Tennessee, 
shows  relatively  small  downstream  reductions  in  phosphorus  and  trans¬ 
parency  and  a  steady  increase  in  chlorophyll-a.  With  a  mean  residence 
time  of  11  days  and  sediment  removal  by  upstream  reservoirs  on  the 
Cumberland  River,  longitudinal  variations  in  I  and  1^  are  not  as  evident 
as  in  above  panels.  The  gradual  increase  in  I_  might  be  a  hydraulic 
residence  time  effect,  i.e.,  the  time  scale  required  for  algal  popula¬ 
tions  to  "equilibrate"  with  available  nutrient  and  light  levels  may  be 
appreciable  in  relation  to  time-of-travel  within  the  pool. 

108.  Similar  behavior  is  evident  in  Lake  Barkley  (Figure  10) , 
which  is  located  further  downstream  on  the  Cumberland  River  and  has  a 
residence  time  of  16  days.  One  important  difference  is  that  the  phos¬ 
phorus  index  remains  consistently  higher  than  the  other  versions.  Both 
ambient  available  nutrient  concentrations  and  bioassays  indicate,  how¬ 
ever,  that  algal  populations  in  Lake  Barkely  were  nitrogen-limited  at 
the  times  of  sampling.  Use  of  Ip  alone  as  a  measure  of  nutrient  avail¬ 
ability  may  not  be  appropriate  in  this  case. 

109.  The  above  case  studies  illustrate  a  number  of  factors  which 
can  be  important  in  assessing  reservoir  trophic  state  relationships: 
sedimentation,  upstream  impoundment  effects,  hydraulic  residence  time 
effects,  and  nitrogen  limitation.  Reservoir  hydrodynamics  partially 
determine  the  transformations  of  these  and  other  factors  into  spatial 
variations  in  the  trophic  state  indicators.  Upstream/ downstream  varia¬ 
tions  contain  information  on  rates  and  directions  of  controlling  processes. 
Graphing  of  spatial  relationships  and  expression/analysis  in  terms  of 
distance  (river  mile)  and/or  time  (time-of-travel)  will  aid  in  quanti¬ 
tative  data  analysis  and  interpretation.  Use  of  station  means  rather 


than  reservoir  means  seems  to  make  more  sense  for  model  testing  purposes. 


Seasonal  Relationships 

110.  Average  seasonal  variations  in  the  index  components  are  de¬ 
picted  in  Figure  11.  Station  means  have  been  computed  and  their  effects 
removed  from  the  data  prior  to  calculating  the  mean  and  standard  error 
for  each  month  (March-November)  and  index  component.  Analyses  of  vari¬ 
ance  indicate  that  monthly  effects  are  significant  (p <  .0001)  for  each 
component  and  strongest  in  the  case  of  chlorophyll-a .  The  seasonal 
variations  depicted  in  Figure  11  are  characteristic  of  this  collection 
of  reservoirs  but  not  necessarily  of  each  reservoir  individually. 

111.  Average  seasonal  effects  on  phosphorus  and  transparency  are 
similar:  both  tend  to  be  lowest  during  March  and  midsummer  and  highest 
during  April  and  November,  possibly  reflecting  seasonal  flow  and  turbid¬ 
ity  variations  and  influences  of  turnover  periods.  Monthly  effects  on 
chlorophyll-a  suggest  a  spring  maximum  (April-May) ,  followed  by  a  June 
depression,  a  midsummer  maximum,  and  lower  values  in  November.  Tempera¬ 
ture  and  light  effects  may  be  responsible  for  the  relatively  low  chloro¬ 
phyll-a  levels  during  March  and  November.  The  June  depression  may  be 
due  to  seasonal  succession  of  algal  species.  A  more  detailed  examination 
of  the  data  indicates  that  lower  June  chlorophyll-a  levels  are  character¬ 
istic  of  about  half  of  the  stations  sampled  in  June,  while  the  rest  have 
June  levels  more  typical  of  May  or  July  values.  In  testing  seasonal 
aspects  of  TSI  behavior,  Carlson30  also  noted  a  June  depression  in  chloro¬ 
phyll-a  index  relative  to  the  phosphorus  index  in  three  natural  lakes. 

112.  Differences  among  various  versions  of  the  index  provide  a 
measure  of  "lake-like"  behavior,  since  the  index  system  is  calibrated  so 
that  IQ,  lT,  and  Ip  values  are  equivalent,  on  the  average,  when  applied 
to  midsummer,  epilimnetic  data  from  northern,  natural  lakes.  Figure  11 
indicates  that  the  range  of  index  means  is  generally  lowest  during  mid¬ 
summer  (approaching  5  during  August)  and  highest  during  March,  June,  and 
November  (approaching  15) .  Minor  recalibration  of  the  phosphorus  and/or 
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transparency  index  would  bring  I  and  IT  into  agreement  for  all  seasons, 
since  the  monthly  effect  curves  in  Figure  12  are  roughly  parallel.  Since 
seasonal  chlorophyll-a  behavior  is  fundamentally  different,  however, 
recalibration  alone  would  not  eliminate  biases  (i.e.,  significant  dif¬ 
ferences  between  I  and  I  or  between  I  and  I_)  for  all  seasons. 

BP  B  I 

113.  Thus,  seasonal  factors  must  be  considered  in  reducing  the 
data  and  in  recalibrating/redesigning  the  index  system  for  use  in  reser¬ 
voirs.  One  approach  would  be  to  restrict  averaging  period  to  midsummer. 
An  alternative  would  be  to  include  explicit  seasonal  effect  terms  in  the 
model  or  index  system.  These  approaches  will  be  investigated  in  future 
model  development  work. 


Variance  Component  Estimation 

114.  Two-way  Analyses  of  Variance  have  been  applied  to  each  reser¬ 
voir  and  index  component  to  test  for  the  significance  of  variations 
associated  with  station  and  sampling  date.  Results  are  summarized  in 
Table  42.  Significant  (p  <•  .1)  differences  among  station  means  in  Ip , 

I_,  and  I  were  found  in  46,  37,  and  52  out  of  105  projects,  respectively. 
Significant  differences  among  date  means  were  found  in  62,  67,  and  64 
projects,  respectively. 

115.  The  following  ANOVA  model  has  been  employed  to  derive  pooled 
estimates  of  the  variance  components  of  each  version  of  the  index: 


rhi  j 


y  +  r,  +  s,  .  +  t  +  e 

h  hi  h]  hi] 


(29) 


where 


y,  . .  =  index  measurement  in  reservoir  h  at  station  i  on  date  j 
hi] 

u  =  grand  mean 

r^  =  average  effect  of  reservoir  h  on  grand  mean 

s,  .  =  effect  of  station  i  in  reservoir  h 
hi 

t,  .  =  effect  of  date  j  in  reservoir  h 
h] 

e,  . .  =  random  effect 
hi] 


117 


Table  42 

Summary  of  ANOVA  Results 

Station  Effects 

Date  Effects  Not  Significant  Significant** 

Phosphorus  Index  - 

* 

Not  Significant  32  11 

Significant  27  35 

Chlorophyll-a  Index  - 

Not  Significant  33  5 

Significant  35  32 

Transparency  Index  - 

Not  Significant  27  14 


Significant 


26 


38 


The  relative  magnitudes  of  the  within-  and  between- reservoir  components 

are  of  special  significance  to  monitoring  and  modelling  efforts.  With 

relatively  large  within-reservoir  components,  it  would  be  difficult  to 

obtain  much  accuracy  in  reservoir  summary  statistics  (e.g.,  reservoir 

mean)  with  limited  data.  This  would  reduce  the  explainable  variance 
2 

(R  )  of  any  model  calibrated  to  the  reduced  data  set,  make  it  more  dif¬ 
ficult  to  distinguish  among  alternative  model  formulations,  and  increase 

the  error  associated  with  model  parameter  estimates. 

2 

116.  SAS  procedures  have  been  used  to  estimate  the  above  variance 
components  for  each  index  separately  and  to  estimate  analogous  covariance 
components  for  each  pair  of  indices  (Ig/IT,  Ig/I  ,  I^/Ip).  Results  are 
shown  in  Figure  12.  The  phosphorus  and  transparency  index  components 
exhibit  similar  patterns:  between-reservoir  differences  account  for 
60-66%  of  the  total  index  variance,  as  compared  with  29%  in  the  case  of 
chlorophyll-a  index.  Between-reservoir  variances  indicate  that  differ¬ 
ences  in  chlorophyll-a  are  considerably  less  marked  than  would  be 
predicted  based  upon  differences  in  the  phosphorus  or  transparency  indices. 
Conversely,  there  is  greater  temporal  and  random  variance  in  chlorophyll-a 
than  in  phosphorus  or  in  transparency. 

117.  The  covariance  components  on  the  right-hand  side  of  Figure  12 
provide  some  insights  into  relationships  among  the  indices  at  different 
averaging  levels.  Tne  between-reservoir  and  between-station  covariance 
components  are  positive  in  all  cases.  Thus,  the  various  versions  of  the 
index  correlate  positively  both  among  reservoirs  and  among  stations 
within  reservoirs.  Temporal  components  indicate  a  positive  covariance 
for  phosphorus/transparency  but  slightly  negative  covariances  for  the 
pairs  involving  chlorophyll-a.  Thus,  when  temporal  variations  a;  a 
given  station  are  analyzed,  one  would  expect,  on  the  average,  to  find 

a  positive  correlation  only  between  the  phosphorus  and  transparency 
indices.  This  correlation  might  be  attributed,  for  exaitple,  to  turbidity 
variations  following  seasonal  or  short-term  (storm-event)  flow  variations. 
Despite  its  positive  covariance  between  reservoirs  and  between  stations, 
chlorophyll-a  does  not  covary  temporally  with  the  other  indices. 


. At 
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118.  The  EPA/NES  data  base  includes  measurements  from  one  growing 
season  within  any  reservoir  and  does  not  permit  testing  for  between-year 
variance  or  covariance  components.  Thus,  it  is  not  possible  with  this 
data  set  to  test  for  year-to-year  covariance  between  chlorophyll-a  and 
phosphorus  or  transparency.  Distinguishing  between  seasonal  and  year- 
to-year  variance  components  will  be  possible  with  an  expanded  data  base 
including  data  from  other  agencies  and  monitoring  programs. 


Error  Analysis 


119.  The  above  analyses  demonstrate  that  within-reservoir  varia¬ 
tions  cannot  be  considered  random  with  respect  to  dates  or  stations,  the 
primary  parameters  used  in  monitoring  program  design.  This  has  impli¬ 
cations  for  estimating  the  accuracy  of  reservoir  or  station  mean  values 
calculated  from  the  data  set.  If  variations  were  random  and  serially 
independent  with  respect  to  station  and  date,  the  following  statistic 
would  be  appropriate  for  estimating  the  variance  of  a  reservoir  mean: 


Var(y)  = 


Var (y) 


where 

N  =  total  number  of  observations  within  a  reservoir  for  a 
given  index 

33 

Using  two-stage  sampling  theory  ,  the  following  formula  is  more  appro¬ 


priate  : 


2  2  2 

ai.  a 

Var  (y)  =  —  +  —  +  ~ 
n  nL  N 

s  t 


where 

2 

o  =  mean  squared  deviation 

n  =  number  of  stations  sampled 
s 

n  =  number  of  dates  sampled 

The  first  term  accounts  for  the  influence  of  between-station  varia¬ 
tions,  the  second  for  between-date  variations,  and  the  third  for  random 


variations.  Note  that  equation  (31)  reduces  to  equation  (30)  when  spatial 

2  2  2 

and  temporal  variations  are  insignificant  (os  =  ot=0  and  Var(y)=a^). 
Because  both  spatial  and  temporal  variations  often  exhibit  patterns  or 
trends  (see  Figures  7-11) ,  they  cannot  be  considered  serially  independent. 
Thus,  equation  (31)  provides  error  variance  estimates  which  are  approximate 
but  useful  for  assessing  the  relative  contributions  of  spatial  and  tem¬ 
poral  variance  to  the  variance  of  reservoir  means. 

120.  Based  upon  the  total  number  of  observations,  stations,  and 
reservoirs  in  the  NES  data  set,  the  "typical"  reservoir  survey  program 
covers  5  stations  (ng)  on  3  dates  (n^.)  ,  for  a  total  of  15  observations 
(N) .  Using  these  values  and  the  pooled  variance  cc.nponents  depicted  in 
Figure  5,  equations  (30)  and  (31)  have  been  applied  to  each  version  of  the 
index  and  results  are  displayed  in  Figure  13. 

121.  Considering  the  effects  of  spatial  and  temporal  variance 
components  (equation  (31))  increases  mean  error  by  about  a  factor  of 
three  over  estimates  derived  from  equation  (30).  Most  of  the  error  vari¬ 
ance  is  due  to  the  temporal  component,  especially  in  the  case  of  the 
chlorophyll-a  index.  The  error  variances  indicate  that  the  EPA/NES 
sampling  strategy  has  provided  estimates  of  reservoir  geometric  means 
which  are  typically  accurate  (p <  .05)  to  within  factors  of  1.6,  2.2,  and 
1.7  for  surface  phosphorus  concentration,  chlorophyll-a  concentration, 
and  Secchi  depth,  respectively. 

Monitoring  Implications 

122.  Error  analyses  can  be  used  to  improve  upon  monitoring  program 
designs.  For  example,  given  the  objective  of  collecting  data  to  be  used 
in  estimating  a  reservoir  mean  witli  minimum  variance,  the  above  results 
suggest  that  an  increase  in  sampling  dates  would  be  more  effective  than 
an  increase  in  sampling  stations,  because  the  date  effect  term  dominates 
the  error  equation.  Since  the  variance  component  estimates  have  been 
pooled,  these  results  apply  to  this  collection  of  reservoirs  as  a  whole 
and  not  necessarily  to  each  reservoir.  The  same  approach  could  be 
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applied  using  parameters  estimated  for  each  reservoir  individually  (n  , 

2  2  2  S 
n  ,  N,  0  ,  a  ,  a  ).  "Optimal"  designs  could  be  formulated  based  upon 
t  s  t  e 

the  error  analysis  framework  and  upon  functions  which  relate  ng,  nt,  and 
N  to  monitoring  cost.  Given  variance  components  estimated  from  prior 
monitoring  data,  improvements  in  program  design  (changes  in  ng,  nfc,  and 
N)  for  a  given  reservoir  could  be  formulated  to  yield  minimum  error  for 
a  fixed  total  cost  or  minimum  cost  for  a  fixed  total  error.  The  approach 
could  be  expanded  to  include  depth  as  a  third  sampling  dimension.  This 
represents  a  logical  application  for  the  error  analysis  framework  in  a 
monitoring  context. 


123.  In  evaluating  models,  differences  between  observations  and 
predictions  can  be  attributed  to  three  types  of  error:  parameter  error, 
data  error,  and  model  error.  The  first  reflects  uncertainty  in  the  model 
coefficients,  the  second,  errors  in  the  predicted  and/or  predictor  vari¬ 
ables,  and  the  third,  influences  of  factors  which  are  not  considered  in 
the  model  structure.  Analyses  of  the  type  conducted  above  can  be  used 

to  quantify  potential  data  errors  and  separate  them  from  the  other  com¬ 
ponents.  This  provides  insights  into  the  adequacy  of  a  data  base  for  use 
in  model  testing.  If,  for  example,  the  data  error  component  dominates, 
it  would  be  difficult  to  distinguish  among  alternative  models  or  to 
improve  upon  them  without  first  improving  the  data  base. 

124.  Table  43  summarizes  results  of  regression  analyses  which 
have  been  done  to  summarize  relationships  among  the  three  versions  of 
the  index  using  station  mean  and  reservoir  mean  values.  Results  further 
demonstrate  the  need  for  recalibration  of  Carlson's  index  system  in 
applications  to  reservoirs.  The  sensitivity  of  the  chlorophyll-a  com¬ 
ponent  to  phosphorus  or  transparency  ranges  from  .30  to  .37  and  is  in 
all  cases  significantly  different  from  1.0,  the  value  obtained  when  the 
index  system  is  calibrated  to  natural  lakes.  In  contrast,  the  sensi¬ 
tivity  of  I  to  I  is  .88  for  stations  means  and  .91  for  reservoir  means. 
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Table  43 

Regression  and  Corresponding  Error  Analyses 


***  Total** ***  Sampling*  Sampling  Error 

Model  r  s,  Error  Error  Total  Error 

_  b 


Station 

i  Means  (n=484) - 

rB  = 

28.7  +  .35  Ip 

.55 

.024 

42.3 

23.1 

.55 

30.0  +  .31  I 

T 

.46 

.027 

47.6 

22.7 

.48 

ZP  = 

4.0  +  .88  I 

T 

.83 

.027 

47.6 

25.9 

.54 

XB  = 

28.0  +  .37  Ip 

.59 

.048 

32.5 

16.3 

.50 

XB  = 

31.0  +  .30  Im 

T 

.44 

.059 

41.0 

16.0 

.39 

XP  = 

2.0  +  .91  T 

T 

.83 

.059 

41.0 

22.1 

.54 

*  Sampling  Error  due  to  error  in  estimating  mean  index  values  within 
each  station  or  reservoir;  sampling  errors  for  station  mean  phosphorus, 
chlorophyll-a  ,  and  transparency  indices  are  estimated  at  14.3,  21.3, 
and  15.0,  respectively;  corresponding  sampling  errors  for  reservoir- 
means  are  11.2,  14.8,  and  13.1. 


**  Total  Error  =  mean  squared  residual 

***  s.  =»  standard  error  of  regression  slope 
o 
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While  the  correlation  coefficients  (and  explained  variance)  are  con¬ 
siderably  lower  for  predicting  chlorophyll-a  (.44-. 59)  than  for  predic¬ 
ting  phosphorus  from  transparency  (.83),  the  mean  squared  prediction 
errors  are  similar. 

i25.  Part  of  the  mean  squared  error  for  each  regression  model  can 
be  attributed  to  sailing  error  in  estimating  the  reservoir  and  station 
mean  values.  The  sampling  error  component  of  the  mean  squared  error  for 
each  model  is  estimated  from: 

Var(y-bx)  -  Var(y)  +  b^Var(x)  (32) 


where , 

y  =  predicted  index 
x  =  predictor  index 
b  =  slope  of  regression  model 

Results  of  error  analyses  have  been  used  to  estimate  sampling  errors  in 
station  mean  and  reservoir  mean  values.  Results  indicate  that  roughly 
half  (39-54%)  of  the  residual  standard  error  can  be  attributed  to  sampling 
error  in  the  data  values.  The  remaining  variance  presumably  reflects 
model  error,  or  the  effects  of  factors  (e.g. ,  season,  sediment)  which 
are  not  accounted  for  in  the  model.  The  choice  of  averaging  method 
(stations  vs.  reservoirs)  does  not  influence  the  model  coefficient  values 
significantly.  Mean  squared  errors  are  reduced  when  reservoir  mean 
values  are  used,  partially  resulting  from  a  reduction  in  the  sampling 
error  component.  The  standard  errors  of  the  regression  slopes,  however, 
increase  roughly  twofold  when  reservoir  mean  values  are  used  in  place 
of  station  means.  Thus,  using  station  means  permits  better  definition 
of  model  coefficients. 

126.  The  intent  of  the  regression  analyses  presented  in  Table  43 
is  to  demonstrate  the  error  analysis  approach  and  identify  influences 
of  data  reduction  method.  The  models  suggest  that  chlorophyll/phosphorus 
and  chlorophyll/transparency  relationships  in  these  reservoirs  are  sig¬ 
nificantly  different  from  those  which  are  typical  of  natural  lakes. 
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The  models  are  inadequate  for  predictive  use,  however,  because  a  number 
of  important  factors  have  not  been  considered,  including  season,  nitrogen 
limitation,  hydraulic  residence  time,  region,  and  external  sediment 
loading.  These  and  other  factors  will  be  considered  in  developing  a 
Trophic  State  Index  system  applicable  to  reservoirs  under  Phase  II  of 
this  project. 


Conclusions 


127.  Statistical  models  of  index  (or  water  quality)  variations 
within  reservoirs  have  been  shown  to  be  complicated  by  the  effects  of 
spatial  and  temporal  variations  and  by  non-randomness  or  serial  depen¬ 
dence  in  these  variations.  The  above  analyses  have  demonstrated  how 
some  of  these  influences  can  be  approximately  treated  and  applied  to 
assess  data  adequacy  for  computing  reservoir  means  and  to  improve  upon 
monitoring  program  designs.  A  more  thorough  statistical  treatment 
would  require  more  intensive  data  sets  and  involve  the  construction  of 
more  complex  statistical  models  applied  separately  to  each  reservoir 
using  simulation  techniques.  This  level  of  detail  is  not  justified  or 
feasible  within  the  context  of  this  study. 

128.  The  following  general  conclusions  can  be  drawn  from  the 
analyses  of  EPA/NES  data  in  previous  sections: 

a.  Spatial  relationships  among  trophic  state  indicators  are 
important  in  some  reservoirs,  especially  when  stations 
are  viewed  in  downstream  order. 

b.  Analyses  of  Variance  indicate  that  within-reservoir  vari¬ 
ations  are  often  non-random  with  respect  to  stations 
and/or  sampling  dates  within  a  given  year.  Variations 
with  respect  to  date  are  typically  stronger  than  varia¬ 
tions  between  stations,  particularly  in  the  case  of 
chlorophyll-a'. 

c.  Some  of  the  temporal  variance  within  reservoirs  and 
stations  is  fixed  with  respect  to  month  or  season. 
Seasonal  effects  on  phosphorus  are  qualitatively  similar 
to  seasonal  effects  on  transparency  but  differ  from 
seasonal  effects  on  chlorophyll-a. 
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d.  Between-reservoir  variance  in  Carlson's  chlorophyll-a 
index  is  roughly  one  third  of  the  between-reservoir 
variances  in  the  phosphorus  and  transparency  indices. 

<2.  For  this  data  set,  chlorophyll-a  variance  between 

sampling  dates  within  reservoirs  is  greater  than  its 
variance  between  reservoir  means.  The  reverse  is  true 
for  phosphorus  and  transparency. 

_f.  Covariance  components  indicate  significant  positive  cor¬ 
relations  among  the  three  versions  of  the  index  system 
when  variations  between  reservoirs  and  between  stations 
within  reservoirs  are  considered.  The  chlorophyll-a 
index  does  not  correlate  with  either  of  the  other  indices, 
however,  when  temporal  variations  (at  a  given  station  or 
within  a  given  reservoir)  are  considered. 

Because  of  non-randomness  with  respect  to  stations  and 
dates,  the  error  variance  of  reservoir  mean  values  for 
each  index  are  typically  three  times  those  estimated  from 
the  familiar  formula  for  mean  variance  (a2/n) . 

h.  Given  the  objective  of  estimating  reservoir  means  with 
minimum  variance,  error  analyses  indicate  that  increases 
in  the  number  of  sampling  dates  would  be  generally  more 
effective  than  increases  in  the  number  of  stations  per 
reservoir  for  improvement  in  the  EPA/NES  sampling  design. 

i^.  Regression  analyses  indicate  that  chlorophyll-a  levels 
are  significantly  less  sensitive  to  phosphorus  or  trans¬ 
parency,  as  compared  with  the  natural  lakes  used  in 
developing  Carlson's  index  system.  Future  development 
of  an  index  system  for  reservoirs  should  consider  the 
effects  of  season,  region,  nitrogen  limitation,  residence 
time,  and  sediment  loading. 

j_.  Use  of  station  means,  as  opposed  to  reservoir  means,  in 
recalibrating  the  index  system  causes  small  increases  in 
standard  error  of  estimate  but  substantially  reduces  the 
standard  errors  of  model  parameters. 


Implications  for  Data  Reduction  Strategies 


129.  The  conclusions  in  the  previous  section  suggest  that  the  fol¬ 
lowing  data- reduction/analytical  strategies  be  used  in  testing  and 
developing  models  under  Phase  II  of  this  project: 

a.  Since  spatial  variations  and  trends  have  been  shown  to  be 
often  statistically  identifiable  and  useful  for  interpreta¬ 
tion  purposes,  use  of  station  means  would  seem  to  be  more 
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desirable  than  use  of  reservoir  means  for  model  testing. 

Use  of  station  means  would  also  permit  better  definition 
of  model  parameters. 

b.  It  would  be  useful  to  develop  a  scheme  for  spatial  orien¬ 
tation  of  stations  within  each  reservoir  with  respect  to 
major  tributary  (arm)  and  distance  (river  mile) .  Some 
aggregation  of  stations  based  upon  proximity  may  be 
feasible  within  this  framework. 

c.  Seasonal  factors  should  be  considered  in  averaging  data 
within  each  station  or  station  group.  This  would  involve, 
for  example,  estimation  of  "spring”,  "summer",  "fall",  as 
well  as  "growing  season"  and  "annual"  averages. 

d.  For  non-NES  stations  which  are  sampled  during  more  than 
one  year,  tests  for  significant  year-to-year  differences 
should  be  made  and  used  to  decide  whether  aggregation  of 
data  from  different  years  is  justifiable. 

e_.  While  the  above  analyses  have  been  based  exclusively  upon 
surface  sampled  for  phosphorus,  averaging  with  depth 
should  be  considered,  at  least  within  the  photic  zone, 
for  testing  of  relationships  among  phosphorus,  chloro- 
phyll-a,  and  transparency. 

£_.  To  permit  testing  of  loading  response  models,  seasonal 

estimates  of  reservoir  means  could  be  derived  by  averaging 
across  stations.  Due  to  longitudinal  variations  in  many 
reservoirs,  however,  near-dam  or  discharge  conditions 
should  also  be  used  as  bases  for  loading  model  evaluations. 
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PART  XIV:  EVALUATION  OF  METHODS  FOR  ESTIMATING  PHOSPHORUS  LOADINGS 


Introduction 

130.  The  estimation  of  reservoir  nutrient  budgets  entails  estima 
tion  of  the  total  mass  of  nutrients  passing  given  sampling  points  over 
a  given  period  of  time,  typically  at  least  one  year.  While  continuous, 
flow-weighted  composite  sampling  for  concentrations  would  provide  the 
best  basis  for  deriving  such  estimates,  usually  only  periodic  grab- 
samples  are  available  for  stream  concentrations.  These  concentrations 
must  be  integrated  with  flow  records  (typically  continuous)  in  order  to 
estimate  the  desired  loadings.  The  purpose  of  this  section  is  to  test 
and  compare  alternative  methods  in  order  to  provide  some  guidance  for 
future  nutrient  budget  calculations  on  CE  reservoirs. 


Preliminary  Analysis 


131.  The  relationship  between  concentraton  and  flow  influences 

the  appropriateness  of  a  given  loading  calculation  method  at  a  given 

station.  A  subset  of  flow  and  phosphorus  concentration  data  from  the 

current  CE  data  base  has  been  analyzed  in  order  to  provide  some  perspec 

34 

tive  on  this  relationship  .  The  subset  includes  86  tributary  and  33 
discharge  stations,  each  with  at  least  25  total  phosphorus/streamflow 
pairs  obtained  from  STORET.  Table  44  describes  the  symbols  and  funda¬ 
mental  equations  used  to  characterize  flow  and  concentration  variations 

132.  Results  of  the  preliminary  analysis  are  given  in  Table  45, 

with  data  grouped  by  station  type.  The  regression  model  relating  the 

logarithm  of  concentration  to  the  logarithm  of  flow  explains,  on  the 

average,  12.3%  of  the  variance  in  concentration  at  tributary  stations 

and  6.6%  of  the  variance  in  concentration  at  discharge  stations. 

2 

Figure  14  shows  that  the  upper  percentiles  of  the  R  distribution  at 
tributary  sites  are  roughly  twice  those  found  at  discharge  sides. 
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Table  44 


Fundamental  Equations  and  Symbols 

Loading  Definition:  L  =  Q  C 

W  «  X  +  Y 

Flow/Concentration  Model :  C  =  a  Q*3 

Y  -  log^a  +  b  X 

Distributions:  Y : (mean=  u  ,  std.dev.=  a  ) 

-  y  y 

X: (mean=  u  ,  std.dev.=  a  ) 
x  x 

Symbols :  L  »  Loading  (mass/time) 

Q  =  Flow  (volume/time) 

C  *  Concentration  (mass/volume) 
W  =  log10(L) 

X  »  log10(Q) 

V  -  log10(C) 

a,b  =  Regression  model  parameters 
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Table  45 

Preliminary  Analysis  of  Flow/Total  P  Concentration  Relationships 


Station  Type 


Statistic 


Tributary 


Discharge 


k; 

k  ' 

number  of  stations 

86 

33 

h 

fraction  of  variance  explained 
by  regression  model 

.123 

*  .151* 

.  066 

+ 

.078 

V 

residual  variance 

.101 

t  .075 

.090 

+ 

.050 

serial  correlation  of  residuals 

.228 

t  .205 

.300 

+ 

.276 

i. 

conc/flow  sensitivity  (b) 

.124 

+  .250 

.097 

+ 

.138 

. 

« 

standard  dev.  of  log  ( flow) 

.573 

t  .246 

.600 

+ 

.303 

(• 

standard  dev.  of  log1Q(conc) 

.323 

t  .133 

.297 

+ 

.088 

*  mean  ±  one  standard  deviation 


Figure  14 


PERCENTILE 
Figure  15 

Distributions  of  Regression  Slopes  at  Tributary  and  Discharge  Stations 


133.  Both  the  mean  and  the  standard  deviation  of  the  regression 
slope  (b)  are  larger  in  the  case  of  tributary  stations.  The  distribu¬ 
tions  of  b  are  compared  on  a  normal  probability  scale  in  Figure  15. 

While  the  medians  are  nearly  equal,  the  upper  percentiles  are  higher 
and  the  lower  percentiles  are  lower  in  the  case  of  tributary  stations. 

The  wide  distribution  of  b  values  across  stations  suggests  that  loading 
calculation  methods  must  be  capable  of  accounting  for  alternative  flow/ 
concentration  relationships. 

134.  These  results  indicate  that  concentration  tends  to  be  more 
flow-dependent  at  upstream  tributary  stations  than  at  discharge  stations. 
Reservoir  pools  may  buffer  variations  associated  with  runoff  events 
(which  would  tend  to  produce  high  b  values)  or  point  source  discharges 
(which  would  tend  to  produce  low  b  values) .  The  fact  that  streamflow 

is  highly  regulated  at  reservoir  discharge  points  may  also  contribute 
to  weaker  flow/ concentration  relationships.  The  serial  correlation  of 
residuals  (i.e.,  concentration,  after  the  effect  of  flow  is  removed) 
tends  to  be  higher  in  the  case  of  discharge  stations  (.30  vs.  .23  on 
the  average) .  This  suggests  that  seasonal  factors  or  long-term  trends 
may  have  greater  influences  at  discharge  points. 

135.  Table  46  lists  means  regression  slopes  by  station  type  and 
component  (dissolved  P,  total  P-dissolved  P,  and  total  P,  respectively). 
This  breakdown  is  based  upon  measurements  of  total  P  and  dissolved  P  at 
52  tributary  and  17  discharge  stations.  For  both  station  types,  the 
mean  slopes  of  dissolved  phosphorus  with  respect  to  flow  are  not  signi¬ 
ficantly  different  from  zero.  In  the  case  of  total  P-dissolved  P,  how¬ 
ever,  the  mean  slope  is  .34  ±  .05  at  tributary  stations  and  .11  ±  .09 

at  discharge  stations,  although  the  latter  is  not  significantly  different 
from  zero.  The  influence  of  streamflow  on  the  transport  of  particulate 
phosphorus  probably  accounts  for  these  results.  The  dissolved  component 
tends  to  be  more  independent  of  flow  than  the  particulate  component. 

The  relative  weakness  of  the  dissolved  phosphorus/flow  relationship  may 
reflect  a  buffering  effect  of  adsorption  chemistry  on  stream  phosphorus 
levels  and/or  the  fact  that  transport  efficiency  for  dissolved  phosphorus 


Table  46 

Concentration/Flow  Sensitivities  by  Component  and  Station  Type 


Component- 


Station 

Type 

Number  of 
Stations 

Total  P  - 
Dissolved  P 

Dissolved  P 

Total  P 

Tributary 

52 

.341  ±  .050* 

.019  ±  .034 

.239  +  .044 

Discharge 

17 

.109  t  .087 

.036  1  .101 

.082  t  .038 

*  mean  +  one 

standard  error  of 

mean 
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is  not  velocity-dependent,  as  in  the  case  of  the  particulate  fraction. 


Estimation  Methods 


136.  Four  algorithms  for  estimating  loadings  have  been  tested: 

a.  average  loading. 

b.  average  concentration  x  average  flow. 

£.  flow-weighted  average  concentration  x  average  flow. 

d.  regression  estimate  based  upon  log  (concentration)  vs. 
log (flow)  relationship. 

These  schemes  are  described  in  Table  47.  a  few  other  types  of  regression 
models  have  also  been  evaluated,  but  none  proved  better  than  the  model 
used  in  Method  4  (see  Table  47) .  Two  approaches  to  evaluating  these 
methods  have  been  taken:  one  involves  subsampling  from  simulated  flow 
and  concentration  data  and  the  other,  subsampling  from  real  flow  and  con¬ 
centration  data.  These  methods  and  results  are  described  below. 

Tests  Based  Upon  Simulated  Data 

137.  Table  48  describes  an  algorithm  used  to  generate  flow  and 
concentration  time  series  using  Monte-Carlo  techniques.  This  method  has 
been  used  to  produce  five  years  of  simulated  daily-average  flow  and  con¬ 
centration  data.  Mean  loadings  have  been  calculated  directly  for  each 
year  and  compared  with  estimates  based  upon  subsampled  taken  at  monthly 
intervals  and  employing  the  calculation  methods  listed  in  Table  47 

For  each  year,  a  total  of  thirty  trials  (sets  of  subsamples)  have  been 
made,  representing  regular  sampling  on  day  1  to  day  30  of  each  month, 
respectively.  Thus,  error  statistics  are  based  upon  a  total  of  150 
trials  for  each  method.  Results  have  been  found  to  be  insensitive  to 
the  number  of  trials.  The  simulation  algorithm  is  not  adequate  for 
evaluation  of  interpolation  methods  (in  which  the  sequence  of  samples 
is  considered) ,  since  no  serial  correlation  or  seasonal  factors  are 
included.  The  parameters  of  the  simulation  model  are  typical  of  the 


Table  47 


Estimation  Methods 


Method  I  :  Average  Loading 

1  Ci 


h.  " 


Method  2  ;  Average  Concentration  x  Average  Flow 


Z  c, 


Method  3  :  Flow- weigh ted  Average  Concentration  x  Average  Flow 


t3  -  Q 


C. 


2  *4 


Method  4  aggression  Estimate 

Eqici  fnQ 


b+1 


S  *4 


where, 

»  flow  on  senile  data  i 
«  concentration  on  sample  date  i 
n  ■  number  of  sample  dates 
b  «  slope  of  log(c)  vs.  log(q)  regression 

Q  -  average  flow  over  entire  period,  based  upon  continuous  flow  record 
£  *  sub  over  all  sample  dates  (i"l  to  n) 


Table  48 


Algorithm  for  Generation  of  Flow/Concentration  Time  Series 


0. 

0. 

.52 
.26 

(-.8  -*•  .8) 

Algorithm; 

xi  -  Ux  +  N(0,l)dx 

Y.  =  vi  +  N(0,l)d  +  (X,  -  y  )b 

i  y  y  i  x 


Input  Values; 


=  mean  of  logarithm  of  flow  = 

p  =  mean  of  logarithm  of  concentration  = 

dx  =  standard  deviation  of  logarithm  of  flow  = 

d  =  standard  deviation  of  logarithm  of  cone  = 

Y  (random  component) 


b  =  slope  of  flow/concentration  model 


Symbol; 

N(0,1)  =  normal  random  deviate  with  mean  zero  and  standard 
deviation  one 


data  distributions  depicted  in  Figure  15  and  Table  45.  A  range  of 
flow/concentration  sensitivities  (b  in  Table  47)  have  been  used  (-.8  to 
+  .8)  . 

138.  The  following  error  statistics  have  been  used  to  compare 
"observed"  and  estimated  loadings  for  method  and  trial: 


-  5  t  (v  * 


(33) 


(34) 


where , 

Bias .  =  bias  associated  with  method  j 
3 

MSEj  =  mean  squared  error  associated  with  method  j 

M  =  total  number  of  trials  =  150 

=  actual  loading  for  trial  k  (mass/time) 

=  estimated  loading  for  method  j  and  trial  k  (mass/time) 

For  each  method,  these  error  statistics  are  plotted  vs.  the  flow/ concen¬ 
tration  sensitivity  statistic  in  Figures  16  and  17,  respectively. 

139.  Only  Method  1  is  unbiased  for  all  values  of  b.  The  MSE  of 
this  method,  however,  is  significantly  higher  than  that  of  the  other 
methods  at  values  of  b  greater  than  -.2.  This  method  is  best  under 
conditions  where  loading  is  relatively  independent  of  flow  (b+-l). 

This  might  be  the  case,  for  instance,  at  a  station  which  is  located 
below  a  major  point  source  of  nutrients,  but  not  where  storm-event  or 
non-point  loads  are  significant. 

140.  The  performance  of  Method  2,  in  which  concentration  and  flow 
are  averaged  independently,  is  a  strong  function  of  b,  both  with  respect 
to  bias  and  variance.  This  method  is  unbiased  only  at  b=0,  but  has 
biases  of  +35%  and  -23%  at  b  values  of  -.2  and  +.2,  respectively.  Trends 
in  bias  continue  moving  toward  more  negative  or  more  positive  b  values. 
The  method  has  a  sharp  minimum  in  MSE  at  b=0.  This  scheme  is  apparently 
best  only  when  concentration  and  flow  are  truly  independent,  but  gives 


139 


REGRESSION  SLOPE  (b) 


substantial  biases  and  MSE's  for  even  weak  tiow/concentration  relation¬ 
ships  • 

141.  Method  3,  which  employs  the  flow-weighted  average  concentra¬ 
tion,  behaves  qualitatively  similar  to  Method  2,  but  is  not  as  sensitive 
to  b  values.  Zero  bias  and  minimum  variance  are  evident  at  b=0.  This 
method  is  analogous  to  a  "ratio  estimate"  used  in  classical  sampling 
theory 

142.  Method  4  has  less  bias  and  variance  than  Method  3  for  most 

b  values.  Unlike  the  other  methods,  the  regression  model  can  adjust  to 

different  types  of  flow/concentration  relationships  and  estimation  errors 

are  less  sensitive  to  b  values.  Bias  becomes  more  significant  at  high 

values  of  b.  At  b=.6,  for  example,  Method  4  underpredicts  loading  by  an 

average  of  15%.  The  MSE  at  this  b  value  is  .13,  which  corresponds  to  a 

standard  error  of  +  36%.  Thus,  bias  is  less  than  one  half  of  one  stan- 

2 

dard  error  and  accounts  for  17%  of  the  MSE  (.15  /.13)  at  b=.6.  Additional 
tests  indicate  that  applying  the  regression  model  separately  to  each 
daily  flow  in  the  year,  a  more  tedious  calculation,  does  not  reduce  the 
bias  or  variance  of  Method  4  at  any  b  value. 

Tests  Based  Upon  Real  Data 

143.  Calculation  methods  have  also  been  tested  using  the  flow  and 
concentration  data  from  the  119  stations  analyzed  above.  Subsamples  of 
12  flow/ concentration  pairs  have  been  taken  at  regular  intervals  from 
each  station.  Loadings  estimated  for  each  subsample  and  method  have 
been  compared  with  loadings  estimated  by  applying  Method  1  to  all  flow/ 
concentration  pairs  available  for  each  station.  A  total  of  508  subsamples 
have  been  taken  from  86  tributary  stations  and  190  subsamples  from  33 
discharge  stations.  Bias  and  MSE  estimates  are  given  in  Table  49  for 
each  method  and  station  type.  These  results  are  approximate  because  of 
errors  involved  in  estimating  actual  loading. 

144.  Generally,  Methods  3  and  4  appear  to  perform  better  than 
Methods  1  or  2  for  both  station  types.  The  MSE  of  Method  3  equals  that 


141 


Table  49 

Results  of  Method  Testing  Using  Real  Flow  and  Concentration  Data 

Estimation  Method* ** 

Statistic  Method  1  Method  2  Method  3  Method  4 

* 

- - - Tributary  Stations  (N=86,M=508) 


Bias 

-.002 

-.027 

-.029 

-.023 

MSE 

.530 

.297 

.176 

.176 

/  \T—  7  7  M- 1  q  n  >  - 

Bias 

.001 

.068 

-.017 

-.000 

MSE 

.272 

.585 

.108 

.156 

*  N  -  number  of  stations,  M  »  number  of  trials 

**  see  Table  47 
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of  Method  4  in  tributary  stations  (.176),  but  is  lower  in  discharge 
stations  (.108  vs.  .156).  When  averaged  over  the  range  of  b  values  in 
the  data  set,  none  of  the  methods  is  appreciably  biased. 

Prior  Error  Estimation 


145.  To  provide  bases  error  analysis  and  assessment  of  data 
adequacy,  a  means  for  estimating  the  variance  of  loading  estimates  de¬ 
rived  from  a  given  set  of  flow/concentration  data  is  needed.  Table  50 
presents  a  formula  appropriate  for  use  with  Method  4  (or  with  Method  3 
when  b=0) .  This  approximate  formula,  derived  from  expected  value  theory, 
has  two  terms:  one  reflecting  variance  around  the  flow/ concentration 
regression  model,  and  one  reflecting  variance  in  the  estimate  of  the 
slope  parameter  b. 

146.  Figure  18  compares  the  observed  mean  squared  estimation 
errors  for  Method  4  with  error  variances  estimated  from  the  formula  in 
Table  50  using  simulated  data.  Reasonable  agreement  between  observed 
and  estimated  variances  is  apparent  over  the  range  of  b  values  typically 
encountered.  Similar  tests  have  been  done  using  real  flow  and  concen¬ 
tration  data.  The  formula  in  Table  50  overestimates  the  error  mean 
square  by  an  average  of  18%  for  tributary  stations  and  underestimates 

the  error  mean  square  by  an  average  of  less  than  1%  at  discharge  stations. 

Conclusions 


147.  Preliminary  data  analysis  has  characterized  the  distributions 
of  flow/ concentration  relationships  in  tributary  and  discharge  streams. 
Methods  3  and  4  are  generally  better  than  Methods  1  or  2  for  estimating 
loadings,  given  the  distribution  of  concentration/f low  sensitivies  en¬ 
countered  at  various  stations  (see  Figure  15) .  Method  3,  which  employs 
the  flow-weighted  average  concentration,  is  actually  a  special  case  of 
Method  4,  with  b=0.  In  calculating  loadings,  it  seems  reasonable  to  use 
a  regression  analysis  to  estimate  the  slope  parameter  b  and  to  use 
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other  symbols  defined  in  Tables  46  and  47 


Method  4  unless  the  slope  estimate  is  not  significantly  different  from 
zero,  in  which  case  Method  3  would  be  used.  The  formula  in  Table  50 
can  be  used  to  derive  approximate  estimates  of  error  associated  with  a 
given  mean  loading  calculated  using  Methods  3  or  4.  Error  variances 
can  be  used,  in  turn,  to  characterize  the  accuracies  of  reservoir  nutrient 
loading  and  discharge  estimates. 


PART  XV:  CONCLUSIONS 


148.  The  compilation,  description,  and  preliminary  analyses  of 
the  data  base  described  in  previous  sections  lead  to  the  following  con¬ 
clusions  : 

a.  Using  primarily  centralized  sources  of  information,  it 
has  been  possible  to  compile  sufficient  data  on  water 
quality,  hydrology,  sedimentation,  and  other  project 
characteristics  to  permit  testing  of  empirical  eutrophi¬ 
cation  models  under  Phase  II  of  this  study. 

b.  The  hydrology  files  need  to  be  augmented  with  monthly 
elevation/contents  data  from  several  districts  and 
projects. 

£.  Information  in  the  water  quality  files  will  not  be  ade¬ 
quate  for  assessing  the  trophic  states  of  all  299  reser¬ 
voirs  in  the  central  project  list.  It  should  be  adequate, 
however,  for  characterizing  about  130  reservoirs  and  for 
testing  empirical  models  relating  within-pool  measures 
of  trophic  state. 

d.  Use  of  data  sources  outside  of  the  EPA/National  Eutro¬ 
phication  Survey  has  generally  more  than  doubled  the 
total  numbers  of  observations  of  various  within-pool 
trophic  state  indicators.  Non-NES  data  are  generally 
more  extensive  temporally. 

£.  Comparisons  derived  from  the  EPA  National  Eutrophication 
Survey  compendium  reveal  significant  differences  between 
lakes  and  reservoirs  in  the  means  of  most  morphometric, 
hydrologic,  and  trophic  state  indicator  variables.  Com¬ 
pared  with  natural  lakes,  reservoirs  have  greater  potential 
nutrient  enrichment  problems,  as  gauged  by  Vollenweider 
phosphorus  loading  models,  but  lower  observed  levels  of 
chlorophyll-a ,  on  the  average.  The  validity  of  existing 
loading  models  in  reservoirs  is  in  question  due  to  lake/ 
reservoir  differences  in  morphometry,  hydrodynamics, 
sedimentation,  and  region.  Because  of  the  relative  geo¬ 
graphical  distributions  of  lakes  and  reservoirs  in  the 
U.  S.,  it  is  difficult  to  distinguish  the  effects  of 
region  from  those  of  impoundment  type. 

_f.  For  the  purposes  of  this  project  reservoir  hypsographic 
curves  can  be  conveniently  summarized  using  low-order 
polynomial  equations  relating  the  logarithm  of  total 
volume  to  the  logarithm  of  total  depth.  Errors  charac¬ 
teristic  of  this  curve-fitting  scheme  are  similar  to 
those  characteristic  of  direct  interpolation  methods. 


i 


Within-reservoir  variations  of  trophic  state  indicators 
generally  cannot  be  considered  random  with  respect  to 
station  and  date,  the  primary  parameters  used  in  monitor¬ 
ing  program  design.  Spatial  variations  contain  informa¬ 
tion  on  rates  and  directions  of  processes  controlling 
eutrophication  within  reservoirs.  Variance  component 
analyses  indicate  that  phosphorus  and  transparency  have 
variance  structures  which  are  similar  to  each  other,  but 
fundamentally  different  from  the  variance  structure  of 
chlorophyll-a.  Implications  of  these  variance  structures 
for  monitoring  and  modelling  efforts  have  been  discussed. 

A  method  for  estimating  the  mean  and  error  variance  of 
nutrient  loadings  derived  from  continuous-flow  and  grab- 
sample-concentration  measurements  has  been  developed  and 
tested  using  real  and  simulated  flow/ concentration  time 
series.  The  method  employs  a  regression  model  relating 
concentration  to  flow  and  has  been  shown  to  compare 
favorably  with  alternative  calculation  methods  with 
respect  to  bias  and  variance  of  loading  estimates  under 
a  range  of  conditions. 


PART  XVI :  RECOMMENDATIONS 


149.  The  following  additional  data  base  development  work  is  re¬ 
quired  in  order  to  permit  testing  of  empirical  eutrophication  models 
and  should  therefore  be  included  in  the  scope  of  Phase  II  of  this  study: 

a.  corqailation  of  monthly  elevation/contents  data  from  many 
districts  and  projects,  as  indicated  by  the  inventories 
in  Appendix  A. 

b.  development  of  a  scheme  for  sorting  of  water  quality 
stations  in  downstream  order  within  reservoirs. 

150.  While  compiled  for  the  specific  purposes  of  this  project,  the 
data  base  described  in  this  report  could  be  adapted  for  other,  more 
generalized  uses.  As  discussed  in  the  introduction  to  this  report,  the 
current  data  base  consists  of  a  number  of  computer  files,  reports,  maps, 
and  data  forms  organized  in  a  consistent  framework.  It  is  not  a  system 
designed  for  frequent  interactive  use.  Given  the  current  data  base,  the 
development  of  such  a  system  would  involve  the  following: 

a.  definition  of  objectives  (desired  scope  of  data  base  uses) . 

b.  definition  of  operating  environment  (direct  users,  main- 
tainence  personnel,  and  computer  system). 

c.  development  of  appropriate  software  for  various  purposes 
(e.g. ,  accessing,  updating,  editing,  summarizing,  dis¬ 
playing,  analyzing) . 

d.  compilation  of  any  additional  data  required  for  intended 
uses  of  the  data  base  (e.g.,  inclusion  of  priority  pollut¬ 
ants,  development  of  a  digital  reservoir  mapping  capa¬ 
bility)  . 

e_.  appropriate  modifications  in  file  designs. 

JF.  establishment  of  channels  and  procedures  for  updating  and 
verification  of  information. 

£.  testing  of  the  system  in  an  operational  environment. 

h.  documentation  of  the  system . 

i^.  orientation  of  potential  users. 

It  is  recommended  that  the  Corps  of  Engineers  consider  expanding  upon  the 
existing  data  base,  as  outlined  above,  in  order  to  permit  more  generalized 
use  of  the  information  which  has  been  compiled  under  this  project.  The 
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APPENDIX  A 

DATA  INVENTORIES  BY  PROJECT  AND  DIVISION 
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